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SECOND OR PARALLEL? OR MULTIPLE? 

52 3685657 RECORD OR RECORDS OR DATA OR FILE OR FILES OR INFORMATION 

OR CITATION?. 

53 368 8754 RECORD OR RECORDS OR DATA OR FILE OR FILES OR INFORMATION 

OR CITATION? 

54 3789485 REMOV? OR DELETE? OR DELETING? OR ERASE? ? OR ERASING? OR - 

DEDUPE? ? OR CHECK? OR FIND???? OR IDENTIF? OR DETECT? OR C- 
OMPAR???? OR MATCH? OR PREVENT? 

55 81570 S1(3N)S3 

56 4991 S5(5N)S4 

57 2876186 MERGE? OR MERGING? OR CONSOLIDAT? OR COMBIN? OR JOIN??? OR 

GROUP? OR COMPRESS? OR COMPOSITE? 

58 3357 S7(5N)S5 

59 250 S6 AND S8 

510 98 0056 MEDICAL? OR PATIENT? OR HEALTH? OR HOSPITAL? OR CLINICAL? 

511 30 S10 AND S9 

512 29 RD (unique items) 

513 27 S12 NOT PY>2001 

514 38 S9 AND (DATABASE? OR DATA ( ) BASE? OR DATA (2N) (WAREHOUS? OR - 

WAREOHOUS? OR MINE? ? OR MINING?) OR DATAMIN? OR DB OR DBS OR 
RECORD? () MANAGE?) 

515 37 RD (unique items) 

516 33 S15 NOT PY>2001 
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6994728 INSPEC Abstract Number: C2001-09-7330-203 
Title: LIFECODE: a deployed application for automated medical coding 

Author (s): Heinze, D.T.; Morsch, M.L.; Sheffer, R.E.; Jimmink, M.A. ; 
Jennings, M.A. ; Morris, W.C.; Morsch, A.E.W. 

Journal: AI Magazine vol.22, no. 2 p. 76-88 

Publisher: American Assoc. Artificial Intelligence, 

Publication Date: Summer 2001 Country of Publication: USA 

CODEN: AIMAEK ISSN: 0738-4602 

SICI: 0738-4 602 (200122 ) 22 : 2L. 76 : LDAA; 1-X 

Material Identity Number: H885-2001-003 

U.S. Copyright Clearance Center Code: 0738-4 602/2001/$2 . 00 
Language: English Document Type: Journal Paper (JP) 
Treatment: Practical (P) 

Abstract: LIFECODE is a natural language processing (NLP) and expert 
system that extracts demographic and clinical information from free-text 

clinical records. The initial application of LIFECODE is for the 
emergency medicine clinical specialty. An application for diagnostic 
radiology went into production in October 2000. The LIFECODE NLP engine 
uses a large number of specialist readers whose particular output are 
combined at various levels to form an integrated picture of the patient 1 s 

medical condition ( s ) , course of treatment, and disposition. The LIFECODE 
expert system performs the tasks of combining complementary information 
, deleting redundant information / assessing the level of medical 

risk and level of service represented in the clinical record, and 
producing an output that is appropriate for input to an electronic medical 

record (EMR) system or a hospital information system. Because of the 
critical nature of the tasks, LIFECODE has a unique "self-awareness" 
feature that enables it to recognize the limits of its competence and, 
thus, ask for assistance from a human expert when faced with information 
that is beyond the bounds of its competence. The LIFECODE NLP and expert 
systems reside in various delivery packages, including online transaction 
processing, a web browser interface, and an automated speech recognition 
(ASR) interface. (26 Refs) 
Subfile: C 

Descriptors: medical expert systems; medical information systems; 
natural languages 

Identifiers: LIFECODE; natural language processing; expert system; 
emergency medicine; diagnostic radiology; free-text clinical records; 
LIFECODE NLP engine; LIFECODE expert system; electronic medical record; 
hospital information system. ; online transaction processing; web browser 
interface; automated speech recognition 

Class Codes: C7330 (Biology and medical computing); C6180N (Natural 
language processing) ; C6170 (Expert systems and other AI software and 
techniques); C7140 (Medical administration) 
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6911594 INSPEC Abstract Number: C2001-06-7330-047 

Title: LifeCode/sup TM/-a natural language processing system for medical 
coding and data mining 

Author (s): Heinze, D.T.; Morsch, M.L.; Sheffer, R.E., Jr.; Jimmink, M.A. ; 
Jennings, M.A. ; Morris, W.C.; Morsch, A.E.W. 

Author Affiliation: A- Life Med. Inc., San Diego, CA, USA 

Conference Title: Proceedings Seventeenth National Conference on 
Artificial Intelligence (AAAI-2000) . Twelfth Innovative Applications of 
Artificial Intelligence Conference (IAAI-2000) p. 965-72 

Publisher: AAAI Press, Menlo Park, CA, USA 



Publication Date: 2000^^untry of Publication: USA ^Bc+1190 pp. 

ISBN: 0 262 51112 6 ^teterial Identity Number: XX-20oWo2069 
Conference Title: Proceedings of the Seventeenth National Conference on 

Artificial Intelligence 

Conference Sponsor: American Assoc. Artificial Intelligence 

Conference Date: 30 July-3 Aug. 2000 Conference Location: Austin, TX, 

USA 

Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: LifeCode/sup TM/ is a natural language processing and expert 
system that extracts demographic and clinical information from free-text 

clinical records. The initial application of LifeCode is for the 
emergency medicine clinical specialty. An application for diagnostic 
radiology is now in beta-test. A pilot program for performing data mining 
on acute care clinical records has been completed. The LifeCode NLP 
engine uses a large number of specialist readers whose particular outputs 
are combined at various levels to form an integrated picture of the 

patient 's medical condition ( s ) , course of treatment and disposition. 
The LifeCode expert system performs the tasks of combining complementary 

information , deleting redundant information , assessing the level of 

medical risk and level of service represented in the clinical record 
and producing an output that is appropriate for input to an electronic 

medical record (EMR) system or a billing system. Because of the critical 
nature of the tasks, LifeCode has a unique . "self-awareness " feature that 
enables it to recognize the limits of its competence and thus ask for 
assistance from a human expert when faced with information that is beyond 
the bounds of its competence. The LifeCode NLP and expert systems are 
wrapped as DCOM servers and reside in various delivery packages including 
On-Line Transaction Processing (OLTP) , a Web-browser interface and an 
automated speech recognition (ASR) interface. (24 Refs) 
Subfile: C 

Descriptors: data mining; medical administrative data processing; 
medical diagnostic computing; medical expert systems; natural language 
interfaces 

Identifiers: natural language processing system; data mining; expert 
system; demographic information; free-text clinical records; diagnostic 
radiology; LifeCode NLP engine; medical condition; treatment course; 
patient disposition; complementary information; redundant information; 
electronic medical record system; EMR system; billing system; 
self-awareness feature; DCOM servers; delivery packages; On-Line 
Transaction Processing; OLTP; online transaction processing; Web-browser 
interface; automated speech recognition interface; ASR) interface 

Class Codes: C7330 (Biology and medical computing); C7140 (Medical 
administration) ; C6170 (Expert systems and other AI software and 
techniques); C6180N (Natural language processing); C6130 (Data handling 
techniques); C6170K (Knowledge engineering techniques); C6160 (Database 
management systems (DBMS) ) 
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6396305 INSPEC Abstract Number: C1999-12-7140-022 
Title: Intelligent data clean-up in epidemiological registers 

Author(s): Hinrichs, H. 

Author Affiliation: Oldenburger Forschungs- und Entwicklungsinst . fur 
Inf.- Werkzeuge und -Syst., Germany 

Conference Title: Datenbanksysteme in Buro, Technik und Wissenschaft 
(Database Systems in Office, Technology and Knowledge Work) p. 291-306 

Editor (s): Buchmann, A. P. 

Publisher: Springer-Verlag, Berlin, Germany 

Publication Date: 1999 Country of Publication: Germany x+465 pp. 
ISBN: 3 540 65606 5 Material Identity Number: XX-1999-00502 

Conference Title: Proceedings of Datenbanksysteme in Buro Technik und 
Wissenschaft. BTW '99. (Database systems in the office technology and 



research) 

Conference Date: l-3^^arch 1999 Conference Loca^ron: Freiburg im 
Breisgau, Germany 

Language: German Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: Notes the heterogeneous character of medical epidemiological 
data, so that quality has to be improved before processing. Use of data 
compression , and removal of redundant data , are considered. 

Announces the "CARELIS 11 computer program (Cancer Registry Lower Saxony 
Record Linkage System) which integrates probabilistic record linkage and 
knowledge processing in order to reduce the amount of manual pre-processing 
required. The steps in cleaning-up data in cancer information are described 
and record standardisation and record linkage are explained. A block 
diagram of the program is presented and knowledge based message processing 
is considered. Applications where data is derived from a data warehouse are 
discussed. (17 Refs) 

Subfile: C 

Descriptors: cancer; data compression; data warehouses; knowledge based 
systems; medical information systems; records management 

Identifiers: intelligent data clean-up; epidemiological registers; 
heterogeneous character; medical epidemiological data; data compression; 
redundant data; CARELIS computer program; Cancer Registry Lower Saxony 
Record Linkage System; probabilistic record linkage; knowledge processing; 
cancer information; record standardisation; knowledge based message 
processing; data warehouse 

Class Codes: C7140 (Medical administration); C6170 (Expert systems and 
other AI software and techniques); C6130 (Data handling techniques); 
C6160Z (Other DBMS) 

Copyright 1999, IEE 
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6049745 INSPEC Abstract Number: C9811-7140- 106 

Title: Combining laboratory data sets from multiple institutions 
using the Logical Observation Identifier Names and Codes (LOINC) 

Author (s): Baorto, D.M.; Cimino, J.J.; Parvin, C.A.; Kahn, M.G. 

Author Affiliation: Dept. of Pathology, Washington Univ. Sch. of Med., 
St. Louis, MO, USA 

Journal: International Journal of Medical Informatics vol.51, no . 1 
p. 29-37 

Publisher: Elsevier, 

Publication Date: July 1998 Country of Publication: Ireland 

CODEN: IJMIF4 ISSN: 1386-5056 

SICI: 1386-5056 (1998 07) 51: 1L. 29: CLDS;1-A 

Material Identity Number: G133-98008 

U.S. Copyright Clearance Center Code: 1386-5056/98/$19 . 00 
Document Number: S1386-5056 ( 98 ) 00089-6 

Language: English Document Type: Journal Paper (JP) 
Treatment: Experimental (X) 

Abstract: A standard set of names and codes for laboratory test results 
is critical for any endeavor requiring automated data pooling, including 
multi-institutional research and cross-facility patient care. This need 
has led to the development of the Logical Observation Identifier Names and 
Codes (LOINC) database and its test-naming convention. For this study, we 
required two medical centers to extract raw laboratory data from their 
local information system for a defined patient population, translate 
tests into LOINC and provide aggregate data which could then be used to 
compare laboratory utilization. We examine a larger number of tests from 
each site which have been recoded using an updated version of the LOINC 
database. We conclude that the coding of local tests into LOINC can often 
be complex, especially the * Kind of Property' field, and apparently trivial 
differences in choices made by individual institutions can result in 
nonmatches in electronically pooled data. 75% of failures to match the same 
tests between different institutions using LOINC codes were due to 



differences in local cHHig choices. LOINC has the pote^^fcl to eliminate 
the need for detailed numan inspection during the pooJ^ig of laboratory 
data from diverse sites and perhaps even a built-in capability to adjust 
matching stringency by selecting subsets of LOINC fields required to match. 
However, a quality standard coding procedure is required, and examples 
highlighted in this paper may require special attention while mapping to 
LOINC. (12 Refs) 
Subfile: C 

Descriptors: codes; database management systems; medical information 
systems; merging; nomenclature 

Identifiers: laboratory data sets; Logical Observation Identifier Names 
and Codes; LOINC database; laboratory test results; automated data pooling; 
multi-institutional research; cross-facility patient care; medical 
centers; raw laboratory data extraction; local information system; patient 

population; laboratory utilization; property type; local coding choices; 
detailed human inspection; matching stringency; database fields; quality 
standard coding procedure 

Class Codes: C7140 (Medical administration); C6130 (Data handling 
techniques); C6160Z (Other DBMS) 
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5787533 INSPEC Abstract Number: B9802-7540-002 , C9802-714 0-082 

Title: Medical images and electronic patient information: finding the 
link 

Author (s): Goel, H. 

Author Affiliation: IMNET Syst. Inc., USA 

Conference Title: Toward an Electronic Patient '97. Conference and 
Exposition. Proceedings Part vol.2 p. 234-5 vol.2 
Editor (s): Waegemann, CP. 

Publisher: Med. Records Inst, Newton, MA, USA 

Publication Date: 1997 Country of Publication: USA 3 vol. 

(387+324+379) pp. 

ISBN: 0 9640667 9 3 Material Identity Number: XX97-03129 

Conference Title: Proceedings of TEPR '97. Toward an Electronic Patient 

Record '97 

Conference Date: 27 April-3 May 1997 Conference Location: Nashville, 
TN, USA 

Language: English Document Type: Conference Paper (PA) 
Treatment: General, Review (G) 

Abstract: Medical images and electronic patient information have been 
separated by a seemingly impenetrable barrier. With the ever-growing 
influence of managed care, the race is now on to electronically link these 
vital sources of information. Traditional PACS solutions have come under 
increasing fire due to their proprietary nature and extremely high cost. 

Health care providers are all racing to establish a computerized patient 

record (CPR) , and they realize that medical images are an integral part 
of the CPR. They also realize that the disparate nature of current PACS 
solutions will not allow them to integrate medical images with the rest 
of their patient data. A link between the two must be found to make the 
CPR a reality. The solution is an integration between the enterprise's 

health care information system (HIS) and on-line medical images; an 
infoPACS solution. InfoPACS are critical to the information integration 
plans of today's health care providers. They supply the missing link 
between radiology and the rest of the enterprise by providing PACS image 
acquisition, display, and archiving components, and integration between 

medical images and electronic patient information. This allows users in 
radiology and throughout the enterprise to view every aspect of a patient 
's medical record at any time, from anywhere. By combining all patient 
information together in the same database, infoPACS solutions remove 

a major barrier to true enterprise-wide integration. (0 Refs) 
Subfile: B C 

Descriptors: integrated software; medical information systems; PACS 



Identifiers: medical ^Jages; electronic patient inf^^ktion; managed 
care; PACS; computerized patient record; health care information system 
; infoPACS; information integration; PACS image acquisition; archiving; 
medical record; patient information; enterprise-wide integration 

Class Codes: B7540 (Hospital Engineering); B7510B (Radiation and 
radioactivity applications in biomedicine) ; B6210L (Computer communications 
); C7140 (Medical administration); C5260B (Computer vision and image 
processing techniques); C5620L (Local area networks); C7330 (Biology and 
medical computing) 
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01134925 INSPEC Abstract Number: C78001815 

Title: An experiment to measure the performance of phonetic key- 
compression retrieval schemes 

Author{s): Greenfield, R.H. 

Author Affiliation: Washington Univ. School of Medicine, St. Louis, MO, 
USA 

Journal: Methods of Information in Medicine vol.16, no. 4 p. 230-3 
Publication Date: Oct. 1977 Country of Publication: West Germany 
CODEN: MIMCAI ISSN: 0026-1270 

Language: English Document Type: Journal Paper (JP) 
Treatment: Experimental (X) 

Abstract: The results of an experiment to measure the performance of the 
Davidson and Soundex phonetic key compression schemes in finding sets 
of records representing the same individual in a moderately large 
radiology patient file as compared to exact surname matches, are 
presented. This is similar to the problem of retrieving a record by name 
under the assumption that neither the search key nor the recorded key is 
accurately known. Both phonetic schemes perform similarly in obtaining 
extra matches and both outperform by a large margin the exact name match. 
The results also indicate that the Davidson scheme is superior to the 
Soundex because it produces significantly fewer mismatches. (7 Refs) 

Subfile: C 

Descriptors: data handling; medical administrative data processing 
Identifiers: experiment; measure; performance; phonetic key compression; 

retrieval schemes; Davidson; Soundex; records; radiology patient file 
Class Codes: C6130 (Data handling techniques); C7140 (Medical 
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1301411 

An experiment to measure the performance of phonetic key compression 
retrieval schemes . 

Author(s): Greenfield, R H 

Corporate Source: Washington University School Of Medicine, St. Louis, 
Missouri . 

Methods of Information in Medicine vol. 16, no. 4, pages 230-233 

Publication Date: 1977 

ISSN: 0026-1270 

Language: English 

Document Type: Journal Article 

Record Type: Abstract 

Journal Announcement: 1300 

The results of an experiment to measure the performance of the davidson 
and soundex phonetic key compression schemes in finding sets of 
records representing the same individual in a moderately large radiology 
patient file, as compared to exact surname matches, are presented. This 



is similar to the problel^^E retrieving a record by name ^^^ r the 
assumption that neither the search key nor the recorded key is accurately 
known. Both phonetic schemes perform similarly in obtaining extra matches 
and both outperform by a large margin the exact name match. The results 
also indicate that the davidson scheme is superior to the soundex because 
it produces significantly fewer mismatches. 

Classification Codes and Description: 5.11 (Searching and Retrieval) 

Main Heading: Information Processing and Control 
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01603431 ORDER NO: AAD98-04033 

ADAPTIVE DETECTION OF APPROXIMATELY DUPLICATE DATABASE RECORDS AND 
THE DATABASE INTEGRATION APPROACH TO INFORMATION DISCOVERY 

Author: MONGE, ALVARO EDMUNDO 
Degree: PH.D. 
Year: 1997 

Corporate Source/Institution: UNIVERSITY OF CALIFORNIA, SAN DIEGO (0033) 
Chairperson: CHARLES P. ELKAN 

Source: VOLUME 58/08-B OF DISSERTATION ABSTRACTS INTERNATIONAL. 

PAGE 4315. 98 PAGES 
Descriptors: COMPUTER SCIENCE ; INFORMATION SCIENCE 
Descriptor Codes: 0984; 0723 

The integration of information is an important area of research in 
databases . By combining multiple information sources, a more 
complete view of the world is attained, and additional knowledge gained. 
This is a non-trivial task however. Often there are many sources which 
contain information about a certain kind of entity, and some will contain 
records concerning the same real-world entity. Thus, one problem in 
.integrating information sources is to identify possibly different 
designators of the same entity. This thesis provides solutions to this data 
cleansing problem. The integration of information sources is also proposed 
as an approach for information retrieval over the worldwide web. 

Data cleansing is the process of purging databases of inaccurate or 
inconsistent data. The data is manipulated into a form which is useful for 
other tasks, such as data mining . This thesis addresses the data 
cleansing problem of detecting database records that are approximate 
duplicates , but not exact duplicates. An efficient algorithm is presented 
which combines three key ideas. First, the Smith-Waterman algorithm for 
computing the minimum edit-dis tance is used as a domain-independent method 
to recognize pairs of approximately duplicates. Second , the union- find 
data structure is used to maintain the clusters of duplicate records 
incrementally, as pair-wise duplicate relationships are discovered. Third, 
the algorithm uses a priority queue of cluster subsets to respond 
adaptively to the size and homogeneity of the clusters discovered as the 
database is scanned. This results in significant savings in the number of 
times that a pairwise record matching algorithm is applied, without 
impairing accuracy. Comprehensive experiments on synthetic databases and 
on a real-world database confirm the effectiveness of all three ideas. 

This thesis also presents W scEBF scIND, an application that discovers 
scientific papers online over the worldwide web. The size of the web makes 
online information retrieval difficult. In such a setting, it is critical 
to know where to concentrate the search for information. W scEBF scIND uses 
a domain-independent algorithm to match records from different sources for 
the purpose of integrating the information in those sources. We describe 
the design of W scEBF scIND, the integration process, and discovery phase. 
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6796252 INSPEC Abstract Number: C2 001-02-7330-065 
Title: Mouse genome informatics in a new age of biological inquiry 
Author (s): Bult, C.J.; Richardson, J.E.; Blake, J. A.; Kadin, J. A. ; 
Ringwald, M. ; Eppig, J.T.; Baldarelli, R.M. ; Baya, M. ; Beal, J.S.; Begley, 
D.A.; Boddy, W.J.; Bradt, D.W. ; Butler, N.E.; Chu, T . ; Corbani, L.E.; 
Corradi, J.; Davisson, M.T.; Garippa, D. ; Glass, L.H.; Grant, P.L.; Hill, 
D.P.; King, B.L.; Krupke, D.M.; Lennon-Pierce, M. ; Lutz, CM. ; Maltais, 
L.J.; Mani, P.; McCright, I.; McKenzie, L. ; Naf, D. ; Ormsby, J.E.; 
Ramachandran, S.; Reed, D.J.; Shaw, D.R.; Szauter, P.; Trombley, L.A. 



Author Affiliation: Ja^^m Lab., Bar Harbor, ME, USA 

Conference Title: Proceedings IEEE International Symposium on 

Bio-Informatics and Biomedical Engineering p. 29-32 
Publisher: IEEE Comput. Soc, Los Alamitos, CA, USA 
Publication Date: 2000 Country of Publication: USA xi+386 pp. 
ISBN: 0 7695 0862 6 Material Identity Number: XX-2000-02862 

U.S. Copyright Clearance Center Code: 0 7695 0862 6/2000/$ 10 . 00 
Conference Title: Proceedings IEEE International Symposium on 

Bio-Informatics and Biomedical Engineering 

Conference Sponsor: IEEE Comput. Soc. ; IEEE Comput. Soc. Task Force on 

Virtual Intelligence (TFVI); IEEE Eng. Med. & Biology Soc; EMB; AIIS; 

ICIIS; ICTAI; NIH; NCI; TIGR 

Conference Date: 8-10 Nov. 2000 Conference Location: Arlington, VA, 

USA 

Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: Data integration is key to knowledge discovery in the age of 
genomics and represents a major, long-standing challenge for the genome 
informatics community. Integration of data across heterogeneous genome 
databases requires the identification of common data entities and 
mechanisms to ensure referential integrity and persistence of these common 
entities even as our understanding of their biological properties changes. 
The Mouse Genome Informatics (MGI) database group employs multiple 
strategies for achieving data integration and maintaining 

inter-connectedness with other databases , including: (1) the use of 
permanent, unique accession IDs for identifying core data entities, (2) the 
application of nomenclature standards for naming genes and strains of mice, 
and (3) the development and implementation of controlled vocabularies and 
ontologies to ensure semantic consistency of biological concepts within and 
across model organism databases . (10 Refs) 

Subfile: C 

Descriptors: biology computing; data integrity; data mining ; 
distributed databases ; genetics; nomenclature; scientific information 
systems; standards; vocabulary; zoology 

Identifiers: mouse genome informatics; biological inquiry; data 
integration; knowledge discovery; genomics; heterogeneous genome databases 
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Title: Data redundancy and duplicate detection in spatial join 
processing 
Author(s): Dittrich, J. -P.; Seeger, B. 

Author Affiliation: Dept. of Math. & Comput. Sci., Marburg Univ., Germany 
Conference Title: Proceedings of 16th International Conference on Data 
Engineering (Cat. No . 00CB37 073 ) p. 535-46 

Publisher: IEEE Comput. Soc, Los Alamitos, CA, USA 

Publication Date: 2000 Country of Publication: USA xxii+703 pp. 
ISBN: 0 7695 0506 6 Material Identity Number: XX-2000-00609 

U.S. Copyright Clearance Center Code: 0 7695 0506 6/2000/$ 10 . 00 
Conference Title: Proceedings 16th International Conference on Data 
Engineering 

Conference Sponsor: IEEE Comput. Soc. Tech. Committee on Data Eng 
Conference Date: 29 Feb. -3 March 2000 Conference Location: San Diego, 
CA, USA 

Language: English Document Type: Conference Paper (PA) 



Treatment: Practical ( 




Experimental (X) 



Abstract: The partition-based spatial-merge join (PBSM) of J.M. Patel and 
D.J. DeWitt (1996) and the size separation spatial join (S/sup 3/J) of N. 
Koudas and K.C. Sevcik (1997) are considered to be among the most efficient 
methods for processing spatial (intersection) joins on two or more spatial 
relations. Neither method assumes the presence of pre-existing spatial 
indices on the relations. In this paper, we propose several improvements to 
these join algorithms. In particular, we deal with the impact of data 

redundancy and duplicate detection on the performance of these 

methods. For PBSM, we present a simple and inexpensive online method to 
detect duplicates in the response set. There is no longer any need to 
eliminate duplicates in a final sorting phase, as was originally suggested. 
We also investigate the impact of different internal algorithms on the 
total run-time of PBSM. For S/sup 3/J, we break with the original design 
goal and introduce controlled redundancy of data objects. Results of a 
large set of experiments with real data sets reveal that our suggested 
modifications to PBSM and S/sup 3/J result in substantial performance 
improvements, where PBSM is generally superior to S/sup 3/J. (33 Refs) 
Subfile: C 
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Author (s): Hammond, B. 

Journal: SIGMOD Record Conference Title: SIGMOD Rec. (USA) vol.28, 
no. 2 p. 527 
Publisher: ACM, 

Publication Date: June 1999 Country of Publication: USA 

CODEN: SRECD8 ISSN: 0163-5808 

SICI: 0163-5808 (199906) 28 : 2L . 527 : MRMS ; 1-0 

Material Identity Number: A660-1999-002 

U.S. Copyright Clearance Center Code: 0163-5808/99/$05 . . . $5 . 00 
Conference Title: 1999 ACM SIGMOD International Conference on Management 
of Data 

Conference Date: 1-3 June 1999 Conference Location: Philadelphia, PA, 
USA 

Language: English Document Type: Conference Paper (PA); Journal Paper 
(JP) 

Treatment: Practical (P) 

Abstract: SQL Server 7.0 offers three different styles of replication: 
transactional replication, snapshot replication, and merge replication. The 
author focuses on merge replication . This means that data changes can 
be performed at any replica, and that the changes performed at multiple 
replicas are later merged together. Because merge replication allows 
updates to disconnected replicas, it is particularly well suited to 
applications that require a lot of autonomy. A special process called the 
merge agent propagates changes between replicas , filters data as 
appropriate, and detects and handles conflicts according to 
user-specified rules. (0 Refs) 
Subfile: C 

Descriptors: client-server systems; merging; query processing; relational 
' databases ; replicated databases ; SQL 

Identifiers: Microsoft; SQL Server; transactional replication; snapshot 



replication; merge repli^^Lon; data changes; disconnecte^^^plicas ; merge 
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Class Codes: C6160D (Relational databases); C6140D (High level languages) 
; C6160B (Distributed databases); C6150N (Distributed systems software) 
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6006014 INSPEC Abstract Number: C9810-7250C-001 

Title: Duplicate detection and record consolidation in large 

bibliographic databases : the COPAC database experience 

Author(s): Cousins, S.A. 

.Author Affiliation: Manchester Comput., Manchester Univ., UK 
Journal: Journal of Information Science vol.24, no . 4 p. 231-40 
Publisher: Bowker-Saur for Inst. Inf. Sci, 
Publication Date: 1998 Country of Publication: UK 
CODEN: JIOSED ISSN: 0165-5515 
SICI: 0165-5515 (1998) 24: 4L. 231: DDRC; 1-1 
Material Identity Number: C262-98004 

Language: English Document Type: Journal Paper (JP) 
Treatment: Practical (P) 

Abstract: COPAC is a union catalogue giving access to the online 
catalogue records of some of the largest academic research libraries in the 
UK and Ireland. Like all union catalogues COPAC is supplied with multiple 
copies of records representing the same document in the contributing 
library catalogues. To reduce the level of duplication visible to the COPAC 
user, it has been necessary to develop duplicate detection and record 
consolidation procedures. These result in the production of a single 
record for each document, representing the holdings of several libraries. 
This paper discusses the ways in which both the duplicate detection and 
record consolidation procedures are carried out, along with the problem 
areas encountered. The general structure of these procedures is also 
described, providing a model of the duplicate record handling mechanisms 
used in COPAC. (6 Refs) 

Subfile: C 

Descriptors: academic libraries; bibliographic systems; cataloguing; 
library automation; research libraries; very large databases 

Identifiers: duplicate detection; record consolidation; large 
bibliographic databases ; COPAC database ; union catalogue; online 
catalogue records; academic research libraries; duplicate record handling 

Class Codes: C7250C (Bibliographic retrieval systems); C7210L (Library 
automation) ; C7240 (Information analysis and indexing) 
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Title: An object deputy model for realization of flexible and powerful 
objectbases 

Author(s): Kambayashi, Y. ; Zhiyong Peng 

Author Affiliation: Dept. of Inf. Sci., Kyoto Univ., Japan 
Journal: Journal of Systems Integration Conference Title: J. Syst. 
Integr. (Netherlands) vol.6, no. 4 p. 329-62 
Publisher: Kluwer Academic Publishers, 

Publication Date: Nov. 1996 Country of Publication: Netherlands 

CODEN: JSINE4 ISSN: 0925-4676 

SICI : 0925-4 67 6 (199611) 6: 4L. 329: ODMR; 1-J 

Material Identity Number: 0661-96005 

U.S. Copyright Clearance Center Code: 0925-4676/96/$8 . 50 

Conference Title: 4th International Conference on Database Systems for 
Advanced Applications 



Conference Date: 10-13^Uril 1995 Conference LocatioH^Singapore 

Language: English Document Type: Conference Paper (PA); Journal Paper 
(JP) 

Treatment: Theoretical (T) 

Abstract: In object-oriented (00) databases , data and methods are 
combined by objects and their classes. Information redundancies are 

removed by the notions of class hierarchy and inheritance. This 
contributes to the realization of high-performance systems. However, the 
flexibility of object structures is a problem, due to its encapsulation 
feature. We introduce an object deputy model which extends conventional 00 
models with the concepts of deputy objects and deputy classes. A deputy 
object has its own identifier and may possess its own attributes and 
methods. It can also have attributes that are computed from values stored 
within its source object, and can be associated with methods generated from 
these of the source object. The inheritances are realized by switching 
operations, which make it possible to realize controllable, selective and 
dynamic inheritance structures. Schemata of deputy objects are defined by 
deputy classes which can be derived by an object deputy algebra. An object 
can have many deputy objects, and several objects can share a single deputy 
object. Thus, objects can be indirectly divided and combined through their 
deputy objects. Several difficult database problems, such as flexible 
views, objects with more than one role, object migration and multiple 
inheritance, become much easier in this model. The data-knowledge 
coordination model developed for the integration of distributed databases 
and knowledge-bases can also be realized easily by the object deputy model. 
We discuss several advanced database applications: geographic databases 
, virtual office systems and distance education systems. (37 Refs) 
Subfile: C 

Descriptors: data encapsulation; database theory; deductive databases 
; distributed databases ; inheritance; object-oriented databases 

Identifiers: object deputy model; powerful flexible object-oriented 
databases ; information redundancy removal ; class hierarchy; flexible 
views; high-performance systems; object structure .flexibility; 
encapsulation; deputy objects; deputy classes; switching operations; 
dynamic inheritance structures; object roles; object migration; multiple 
inheritance; data-knowledge coordination model; distributed databases ; 
knowledge bases; geographic databases ; virtual office systems; distance 
education systems 

Class Codes: C6160J (Object-oriented databases); C6120 (File 
organisation) ; C4250 (Database theory) 
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Title: The field matching problem: algorithms and applications 
Author (s): Monge, A.E.; Elkan, CP. 

Author Affiliation: Dept. of Comput. Sci . & Eng., California Univ., San 
Diego, La Jolla, CA, USA 

Conference Title: KDD-96 Proceedings. Second International Conference on 
Knowledge Discovery and Data Mining p. 267-70 

Editor (s): Simoudis, E.; Han, J.; Fayyad, U. 

Publisher: AMI Press, Menlo Park, CA, USA 

Publication Date: 1996 Country of Publication: USA xiv+391 pp. 
Material Identity Number: XX96-02511 

Conference Title: Proceedings of 2nd International Conference on 
Knowledge Discovery and Data Mining (KDD-96) 

Conference Date: 2-4 Aug. 1996 Conference Location: Portland, OR, USA 
Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: To combine information from heterogeneous sources, equivalent 
data in the multiple sources must be identified . This task is the 
field matching problem. Specifically, the task is to determine whether or 
not two syntactic values are alternative designations of the same semantic 



entity. For example, addresses "Dept. of Compu^^ Sci . and Eng., 

University of California, San Diego, 9500 Gilman Dr. Dept. 0114, La Jolla, 
CA 92093" and "UCSD, Computer Science and Engineering Department, CA 
92093-0114" designate the same department. This paper describes three field 
matching algorithms, and evaluates their performance on real-world 
datasets. One proposed method is the well-known Smith-Waterman (1981) 
algorithm for comparing DNA and protein sequences. Several applications of 
field matching in knowledge discovery are described briefly, including 
WEBFIND, which is a new software tool that discovers scientific papers 
published on the World Wide Web (WWW) . WEBFIND uses external information 
sources to guide its search for authors and papers. Like many other WWW 
tools, WEBFIND needs to solve the field matching problem in order to 
navigate between information sources. (12 Refs) 
Subfile: C 

Descriptors: bibliographic systems; deductive databases ; knowledge 
acquisition; merging; online front-ends; software performance evaluation; 
string matching 

Identifiers: field matching problem; heterogeneous information sources; 
information combination; equivalent data identification; syntactic values; 
alternative designations; semantic entity; performance evaluation; DNA 
sequences; protein sequences; knowledge discovery; WEBFIND; scientific 
papers; World Wide Web; external information sources; author search; 
information navigation; addresses 

Class Codes: C6160K (Deductive, databases); C6170T (Knowledge engineering 
tools); C7250N (Front end systems for online searching); C7250C ( 
Bibliographic retrieval systems) 
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Title: Should journal references be standardized? 

Author ( s ) : Hudnut , S . K . 

Conference Title: 12th National Online Meeting. Proceedings 1991 p. 
149-55 

Editor (s) : Williams, M.E. 

Publisher: Learned Inf, Medford, NJ, USA 

Publication Date: 1991 Country of Publication: USA xii+458 pp. 

ISBN: 0 938734 51 2 

Conference Sponsor: Learned Inf 

Conference Date: 7-9 May 1991 Conference Location: New York, NY, USA 
Language: English Document Type: Conference Paper (PA) 
Treatment: General, Review (G) ; Practical (P) 

Abstract: References to journal articles are a very large part of online 
searching. Yet identifying which online records are duplicates of 
each other isn't always easy The handling of titles, authors, journal names 
and source information across databases can be different enough to 
require close scrutiny before determining if a group of records refers 
to the same article. Using duplicate detection or scanning journal name 
indexes points up the many variations that can exist between databases . 
Is standardization the answer? If so, by whom? The paper examines the 
issues from the perspective of the three participants: the database 
producer, the online service and the user, and considers the impact of the 
existing problems on online searching. (0 Refs) 

Subfile: C 

Descriptors: information services; standardisation 

Identifiers: journal references; online searching; standardization; 
database producer; user 
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Title: Detecting duplicates: a searcher's dream come true 

Author(s): Miller, C. 

Author Affiliation: Carmen Miller Resources, Seattle, WA, USA 
Journal: Online vol.14, no. 4 p. 27-34 
Publication Date: July 1990 Country of Publication: USA 
CODEN: ONLIDN ISSN: 0146-5422 

Language: English Document Type: Journal Paper (JP) 
Treatment: Practical (P) 

Abstract: DIALOG'S announcement of a duplicate record detection 
feature in 1989 was met with enthusiasm. Searchers have been requesting a 
way to identify duplicate records since long before the 1987 

introduction of OneSearch DIALOG'S multiple-file search capability. Once 
one begins to search multiple databases with overlapping sources, the 
need for a way to sort merged output, identify the duplicates and 
retain the records one chooses becomes imperative. Duplicate detection 
is a long-awaited, major system enhancement which allows searchers to do 
all of these things. Duplicate detection was introduced in approximately 
200 of DIALOG'S bibliographic files and there are plans to add 60 to 70 
more of the full-text newspaper, newswire, and journal article files. The 
author explains how she tested the DIALOG feature, how it works and its 
performance. (0 Refs) 

Subfile: C 

Descriptors: bibliographic systems; information services 
Identifiers: duplicate record detection feature; OneSearch; 

multiple-file search capability; multiple databases ; overlapping sources; 

merged output; system enhancement; searchers; bibliographic files; 

full-text newspaper; newswire; journal article files; DIALOG feature 
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Title: Automatic merging of monographic data bases - identification 
of duplicate records in multiple files : the IUCS scheme 
Author (s): Williams, M.E.; MacLaury, K.D. 

Author Affiliation: Information Retrieval Res. Lab., Univ. of Illinois, 
Urbana, IL, USA 

Journal: Journal of Library Automation vol.12, no. 2 p. 156-68 
Publication Date: June 1979 Country of Publication: USA 
CODEN: JLAUAY ISSN: 0022-2240 

Language: English Document Type: Journal Paper (JP) 
Treatment: Practical (P) 

Abstract: A research project entitled 'A State-wide Catalog Feasibility 
Study' was funded by the Illinois State Library and carried out within the 
Information Retrieval Research Laboratory (IRRL) of the Coordinated Science 
Laboratory (CSL) at the University of Illinois in Urbana-Champaign . The 
purpose of the project was to develop a machine algorithm for locating and 
pairing duplicate records in machine-readable bibliographic files in order 
to eliminate duplicates and permit files from different libraries to be 
used for creation of a union catalog of machine-readable bibliographic 
records. A prototype system called IUCS was developed and tested on sample 
files from OCLC, Northwestern University, and the University of Chicago. ( 
1 Refs) 

Subfile: C 

Descriptors: library mechanisation 

Identifiers: monographic data bases ; duplicate records; multiple 
files; IUCS scheme; Information Retrieval Research Laboratory; Coordinated 
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OCLC introduces SiteSearch 4 . 1 

Information Today , November 1, 1999 , vl6 nlO p57-58, 2 Page(s) 

ISSN: 8755-6286 

Company Name : OCLC 

URL: http://www.oclc.org 

Product Name: SiteSearch 4.1 

Languages: English 

Document Type: Product Announcement 
Hardware/Software Compatibility: IBM PC Compatible 
Geographic Location: United States 

Announces the release of version 4.1 of the OCLC SiteSearch ($NA) suite 
of software from OCLC of Dublin, OH (614) . Reports that it provides a 
comprehensive solution for managing distributed library information 
resources in a Web environment. Says that it offers a toolkit that lets 
libraries integrate their electronic resources under one Web interface, 
provide flexible access to those resources, and build unique local 

databases . Notes that it includes the following features: Web-based 
record creation, vocabulary-assisted searching, merged results sets, 

duplicate record detection , more starter interface options, and 

online updates. Says that it uses Extensive Markup Language (XML) and 
Resource Description Framework (RDF) within its new record-building 
component. Adds that the Database Builder component uses RDF and XML as 
the underpinnings of a Java-based toolkit. (KMH) 

Descriptors: Library Automation; XML; Java; Library; Database ; 

Interface; Records Management 
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Low-cost Double Take keeps data honest 

Hollander, Geoffrey 

InfoWorld , July 12, 1999 , v21 n28 p56, 1 Page(s) 
ISSN: 0199-6649 

Company Name: PeopleSmith Software 
URL: http://www.peoplesmith.com 
Product Name: Double Take 4.1 
Languages : English 
Document Type: Software Review 
Grade (of Product Reviewed) : B 

Hardware/Software Compatibility: IBM PC Compatible 
Geographic Location: United States 

Presents a favorable review of Double Take 4.1 ($3,000), a database 
cleaning and conditioning utility program from PeopleSmith Software of 
Scituate, MA (800) . Runs on IBM PC Compatibles with Microsoft Windows NT, 
Windows 98 and Windows 95. Says that it is capable of handling a variety of 
files, file types, and sizes. Cites features such as an array of 
problem-solving alternatives, rapid processing time, and flexible output 
options. Reports, however, that it takes time for the user to master it and 
it does not work with SQL Server 7. Concludes that it is a good solution 
for finding and removing duplicate records or to merge unique 

records from disparate databases . Received a rating of four on a scale of 
one to five. Includes one product summary. (MEM) 

Descriptors: Utility Program; Database Management; Enterprise 
Computing; File Management; Upgrade 
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LIFE CODE A Deployed Application for Automated Medical Coding 

Heinze, Daniel T; Morsch, Mark L; Sheffer, Ronald E 
AI Magazine v. 22 no2 (Summer 2001) p. 76-88 

DOCUMENT TYPE: Feature Article ISSN: 0738-4602 LANGUAGE: English 
RECORD STATUS: Corrected or revised record 

ABSTRACT: LIFECODE is a natural language processing (NLP) and expert 
system that extracts demographic and clinical information from free-text 
clinical records. The initial application of LIFECODE is for the emergency 
medicine clinical specialty. An application for diagnostic radiology went 
into production in October 2000. The LIFECODE NLP engine uses a large 
number of specialist readers whose particular output are combined at 
various levels to form an integrated picture of the patient's medical 
condition (s ) , course of treatment, and disposition. The LIFECODE expert 
system performs the tasks of combining complementary information , 
deleting redundant information , assessing the level of medical risk 
and level of service represented in the clinical record, and producing an 
output that is appropriate for input to an electronic medical record (EMR) 
system or a hospital information system. Because of the critical nature of 
the tasks, LIFECODE has a unique "self-awareness" feature that enables it 
to recognize the limits of its competence and, thus, ask for assistance 
from a human expert when faced with information that is beyond the bounds 
of its competence. The LIFECODE NLP and expert systems reside in various 
delivery packages, including online transaction processing, a web browser 
interface, and an automated speech recognition (ASR) interface. Reprinted 
by permission of the publisher. 

DESCRIPTORS: Natural language processing; Medical databases ; 
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A knowledge-based approach for duplicate elimination in data cleaning. 

Author(s): Low, Wai Lup; Lee, Mong Li; Ling, Tok Wang 

Corporate Source: National University, Singapore 117543 ; National 

University, Singapore 117543 ; National University, Singapore 117543 
Information Systems vol. 26, no. 8, pages 585-606 
Publication Date: December 2001 
ISSN: 0306-4379 

Publisher URL: http://www.elsevier.nl 

Language: English 

Document Type: Journal Article 

Record Type: Abstract 

Journal Announcement: 3701 

Existing duplicate elimination methods for data-cleaning work on the 
basis of computing the degree of similarity between nearby records in a 
sorted database . High recall can be achieved by accepting records with 
low degrees of similarity as duplicates, at the cost of lower precision. 
High precision can be achieved analogously at the cost of lower recall. 
This is the recall-precision dilemma. Develops a generic knowledge-based 
framework for effective data cleaning that can implement any existing data 
cleaning strategies and more. Proposes a new method for computing 
transitive closure under uncertainty for dealing with the merging of 
groups of inexact duplicate records , and explains why small changes to 
window sizes has little effect on the results of the sorted neighborhood 
method. Experimental results with two real-world datasets show that this 
approach can accurately identify duplicates and anomalies with high recall 
and precision, thus effectively resolving the recall-precision dilemma. 



Descriptors: Inf ormatio^^retrieval ; Precision; Recall; ^^^>licate 
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Duplicate detection and record consolidation in large bibliographic 
databases : the COPAC database experience. 

Author(s): Cousins, Shirley Anne 

Journal of Information Science vol. 24, no. 4, pages 231-240 

Publication Date: June 1998 

ISSN: 1352-7460 

Language: English 

Document Type: Journal Article 

Record Type: Abstract 

Journal Announcement: 3309 

Presents a case study of COPAC, a union catalog with access to the online 
catalog records of some of the largest academic research libraries in the 
U.K. and Ireland. Points out that COPAC, like all union catalogs, is 
supplied with multiple copies of records representing the same document in 
the contributing library catalogs, and that it is neccessary to develop 
duplicate detection and record consolidation procedures in order to 
reduce the level of duplication visible to the COPAC user. Focuses on 
measures taken to carry out the duplicate detection and record 
consolidation procedures, and discusses problem areas encountered. 
Describes the general structure of these procedures, thus providing a model 
of the duplicate record handling mechanisms used in COPAC. 

Descriptors: Cataloging; Online catalogs; Union catalogs 
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Method of merging large databases in parallel. 

Author(s): Hernandez, M A; Stolfo, S.J. 
Patent Number(s) : US 5497486 
Publication Date: Mar 5, 1996 
Language: English 
Document Type: Patent 
Record Type: Abstract 
Journal Announcement: 3100 

The semantic integration problem for merging multiple databases of very 
large size, the merge/purge problem, can be solved by multiple runs of the 
sorted neighborhood method or the clustering method with small windows, 
followed by the computation of the transitive closure over the results of 
each run. The sorted neighborhood method works well under this scheme but 
is computationally expensive due to the sorting phase. An alternative 
method is introduced, based on data clustering that reduces the complexity 
to linear time making multiple runs followed by transitive closure feasible 
and efficient. A method is provided for identifying duplicate records 
in a database , each record having at least one field and a plurality of 
keys, including the steps of sorting the records according to a criteria 
applied to a first key; comparing a number of consecutive sorted records to 
each other, wherein the number is less than a number of records in said 



database and identifyi^B a first group of duplicate^Mrecords ; 

storing the identity of t7Te first group; sorting the recorcre according to a 
criteria applied to a second key; comparing a number of consecutive sorted 
records to each other; and subjecting the union of the first and second 
groups to transitive closure. 

Descriptors: Databases ; Integration; Parallel processing; Semantics 
Classification Codes and Description: 5.04 (Advanced Computing, Parallel 

Processing); 6.02 (Bibliographic Search Services, Databases ) 
Main Heading: Information Processing and Control; Information Systems and 

Applications 
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Abstract (Basic) : US 20030046280 Al 

NOVELTY - The method involves identifying two records. A record 
matching criteria is then applied to compare the data in the two 
identified records to determine a common data. The data common in both 
the records are merged together into a composite record in response to 
the determined common data. 

USE - Used for consolidating multiple records in various 
organizations and enterprises e.g. health care field. 

ADVANTAGE - The redundant and duplicate data are identified 
effectively and consolidated into a single entity and stored ones 
repository. This method allows the space to be increased in the memory 
by deleting all the unwanted data stored. The record about a patient is 
maintained as unique one if the patient has one or more records, 
thereby limiting the problem of receiving the wrong treatment. 

DESCRIPTION OF DRAWING ( S ) - The drawing shows a system including an 
application for preventing the creation of duplicate records and for 
consolidating multiple records that are associated with a single entity 
and managing the associated workload. 

Record manager application (200) 

Duplicate record report process (205) 
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Clinical data repository. (217) 
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Abstract: LIFECODE, is a natural language processing (NLP) and expert 
system that extracts demographic and clinical information from 
free-text clinical records. The initial application of LIFECODE. is 
for the emergency medicine clinical specialty. An application for 
diagnostic radiology went into production in October 2000. The LIFECODE 
NLP engine uses a large number of specialist readers whose particular 
output are combined at various levels to form an integrated picture of 
the patient ! s medical condition ( s ) , course of treatment, and 
disposition. The LIFECODE expert system performs the tasks of 
combining complementary information , deleting redundant 
information , assessing the level of medical risk and level of 
service represented in the clinical record, and producing an output 
that is appropriate for input to an electronic medical record ( 
EMR ) system or a hospital information system. Because of the 
critical nature of the tasks, LIFECODE has a unique "self-awareness" 
feature that enables it to recognize the limits ' of its competence and, 
thus, ask for assistance from a human expert when faced with 
information that is beyond the bounds of its competence. The LIFECODE 
NLP and expert systems reside in various delivery packages, including 
online transaction processing, a web browser interface, and an 
automated speech recognition (ASR) interface. 
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Abstract (Basic) : US 20030046280 Al 

NOVELTY - The method involves identifying two records . A record 
matching criteria is then applied to compare the data in the two 
identified records to determine a common data . The data 
common in both the records are merged together into a composite 
record in response to the determined common data. 

USE - Used for consolidating multiple records in various 
organizations and enterprises e.g. health care field. 

ADVANTAGE - The redundant and duplicate data are identified 
effectively and consolidated into a single entity and stored ones 
repository. This method allows the space to be increased in the memory 
by deleting all the unwanted data stored. The record about a patient 
is maintained as unique one if the patient has one or more records, 
thereby limiting the problem of receiving the wrong treatment. 

DESCRIPTION OF DRAWING (S) - The drawing shows a system including an 
application for preventing the creation of duplicate records and 
for consolidating multiple records that are associated with a 
single entity and managing the associated workload. 

Record manager application (200) 

Duplicate record report process (205) 

Web browser (210) 

Electronic master person index (213) 

Clinical data repository. (217) 
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Abstract (Basic) : WO 200065522 A2 

NOVELTY - The system has display programs (204a, 204b) to display 
registered medical records. The medical service provider databases 
(200) store electronic medical records having information indicating 
histories of patients associated with the respective service 
provider. A registry repository includes a main registry database (104) 
with registered electronic medical records. 

. DETAILED DESCRIPTION - The system includes a match / merge 
program (206) which duplicates the electronic medical records 
associated with the medical service provider databases (200) , based 
on unique patient information, to form the registered medical 
record of complete patient history. The program assigns a unique 
patient identifier to each patient and assigns the same identifier 
to each medical service provider database record stored in the main 
registry database (104). A standard data exchange format is used 
between the main registry database and the service provider databases 
to transfer medical records in a standardized form. An INDEPENDENT 
CLAIM is also included for method of storing and accessing electronic 
medical records. 

USE - For transferring electronic medical records from 
heterogeneous databases through computers across local networks and 
internet, into a central registry database to merge the records for 
presentation . 

ADVANTAGE - Allows all service care providers to become part of a 
network that is updated with immunization information from both private 
and public databases which are willing to share electronic medical 
records. The electronic records are processed through a match / merge 
algorithm, to identify the multiple sourced records belonging to a 
single patient , and provide a completed patient history at the 
point of service. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram 
illustrating the basic architecture for the flow of information and the 
interfaces to the main registry database. 

Databases (104, 200) 

Display programs (204a, 204b) 

Match/merge program (206) 



; Medical ; record; register; system; 



pp; 28 DwgNo 2/l\ 

Title Terms: ELECTRONIC; T1EDICAL ; RECORD; REGISTER; SYSTT5R; DISPLAY; 

REGISTER; ELECTRONIC; MEDICAL ; RECORD; MAIN; REGISTER; DATABASE; 

COLLECT; STORAGE; INFORMATION; PATIENT ; SERVICE 
Derwent Class: T01 

International Patent Class (Main) : G06F-019/00 
File Segment: EPI 



?tr S13/5/1, 6,7 




13/5/1 (Item 1 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent . All rts . reserv. 

015770436 **Image available** 

WPI Acc No: 2003-832638/200377 

XRPX Acc No: N03-665593 

Record classification method, for records for delivery to different 
destinations, involves forming record group and assigning destination 
identification to each replica record except one record in that 
group 

Patent Assignee: IBM CANADA LTD (IBMC ); COCHRANE R J (COCH-I); LAPIS G 
(LAPI-I); PIRAHESH H (PIRA-I); RIELAU S (RIEL-I); SIDLE R S (SIDL-I); ZHU 
B (ZHUB-I) 

Inventor: COCHRANE R J; LAPIS G; PIRAHESH H; RIELAU S P; SIDLE R S; ZHU B; 
RIELAU S 

Number of Countries: 002 Number of Patents: 002 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030204540 Al 20031030 US 2003393852 A 20030321 200377 B 

CA 2384174 Al 20031029 CA 2384174 A 20020429 200378 

Priority Applications (No Type Date) : CA 2384174 A 20020429 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030204540 Al 19 G06F-012/00 
CA 2384174 Al E G06F-017/30 

Abstract (Basic) : US 20030204540 Al 

NOVELTY - A group of replica records is formed for a record 
among selected subset of records. A destination identification is 
assigned for each replica record in the group except one replica 

record . A replica record associated with one of several 
destinations, is selected and placed in that destination. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) computer readable medium storing instructions for selecting 
subset of records; 

(2) data processing system. 

USE - For classifying records for delivery to different targets or 
destinations. 

ADVANTAGE - Records of information are classified for delivery to 
different targets or destinations reliably, even though it is not 
predictable as to which target, a given unit of data is to be routed. 

DESCRIPTION OF DRAWING (S) - The figure shows a block diagram of the 
record classification system. 

records (102) 

database management system (106) 

computer system (116) 

memory (118) 

network computer (128) 
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Dynamic and hybrid sparse indices generating method for workfiles used in 

SQL queries in relational database management system 
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Abstract (Basic) : US 5978792 A 

NOVELTY - A sparse index stores entries containing key value that 
indicates approximate location of data records. Sparse index is probed 
to determine starting location of next search of key value in the 
query. An entry is returned from the sparse index without performing 
next search, when it matches with the key value, otherwise next search 
is performed. 

DETAILED DESCRIPTION - The sparse index comprises a fixed number of 
key values along with their record identifiers (RIDs) (410) in a 
file . Duplicate entries are eliminated during the merge phase of 
file creation. A dynamic portion (404) is created for sparse index 
during the search phase of file (406) access. The exact location of the 
data record in the file, is determined using an auxiliary search and 
the approximate location provided by the sparse index. An INDEPENDENT 
CLAIM is also included for query performing apparatus. 

USE - For RDBMS using dynamic and hybrid sparse index, in 
mainframe, minicomputer and personal computers. 

ADVANTAGE - The performance of workfiles used in SQL queries is 
optimized by searching data record using sparse index. 

DESCRIPTION OF DRAWING (S) - The figure depicts the block diagram 
illustrating hybrid sparse index comprising static and dynamic portion. 

Dynamic portion (404) 
File (406) 

Record identifiers (410) 
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Abstract (Basic) : JP 9319635 A 

The method involves sequentially receiving updating indication and 
updating record for a record registered into a database. A combining 
process is performed to the updating record which has the same 
identification - After the combining process, the combined updating 
record is used to update the database record. 

ADVANTAGE - Facilitates batch processing and updating process of 
database record. Effectively utilises memory which stores updating 
record, by erasing unnecessary old record at every combining process 
execution. Reduces load of batch processing by reducing number of batch 
records . Enables optimum implementation of whole client and server type 
database system. 
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A] 



PROBLEM TO 
accuracy. 



BE SOLVED : 



ABSTRACT 

To predict information use frequency at high 



SOLUTION: In a program, a comparison part where a second time-series 
data D20 concerning the frequency of use of 'new year postcards 1 as a 
separate keyword with a first time-series data D10 concerning the frequency 
of use of 'camera 1 as the keyword inputted in a keyword search of the 
Internet as a reference is sequentially shifted in units of specified time 
on a time axis and the relative coefficient of the first time-series data 
D10 and the second time-series data D20 for each specified time unit is 
calculated and a prediction part specifying a group of the first 
time-series data D10 and the second time- series data D20 
corresponding to the relative coefficient with the greatest value among the 
multiple number of relative coefficients calculated by the comparison part 
and predicting the frequency of use of the first time-series data D10 
constituting the group based on the second time-series data D20 are 
provided . 
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ABSTRACT 

PROBLEM TO BE SOLVED: To suppress the capacity of an index file by 
compressing index information more efficiently and to enable high-speed 
retrieval using the index file. 

SOLUTION: When entries of index information are successively identical , 
this information compressing method deletes by an index information 
compression part 107 the entries except only one entry, puts (n) successive 
pieces of index information with the same document number in one group 



as to index informal^Bn in the same character ch^B, and generates 
compressed index information having the document number compressed into 
one. When byte data of position numbers are not larger than a reference 
value ( O× Of ) , the low-order four bits of each of the byte data of the 
position numbers are connected to generate compressed index information 
having the position number converted into 1-byte data. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To provide the data compression method and the data 
compressor by which data can be compressed at a high speed without taking 
mush time for data retrieval by adopting the hash method for data 
compression by the LZ slide dictionary method so as to retrieve data. 

SOLUTION: When a data input output device 1 reads compression processing 
object data to a memory 3, a control circuit 2 retrieves the data string 
the same as compression processing object data string from a hash table 
by the hash method and compares the data of the retrieved data string with 
that of the compression processing object data string one by one and 
provides an output of equal data length of both the data strings, 
repeats the retrieval and comparison till the retrieval object is not 
in existence and selects a data string whose equal data length is longest 
among data strings at the retrieved hand position and succeeding positions 
and provides an output of the head position of the selected data string and 
the equal data length as equality information and registers the head 
position of the compression processing object data string to a hash table 
as a retrieval object. 



14/5/9 (Item 1 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 

015887469 **Image available** 
WPI Acc No: 2004-045304/200405 
XRPX Acc No: N04-036920 

Multimedia data fragmenting method for Internet, involves identifying 

pair of nodes which minimizes parameter proportional to data common to 
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Abstract (Basic) : EP 1367831 A2 

NOVELTY - The multimedia data (D'6-D'9) associated with nodes 
(N'6-N'9) are grouped into a temporary cell. When the temporary cell 
reaches a threshold size, a pair of nodes (N'7,N'8) which minimizes a 
parameter proportional to number of data common to associated data 

(D ! 7,D'8) is identified , for forming a definitive cells (C5-C7] 
comprising data (D'5-D'7). 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
multimedia data fragmenting device. 

USE - For fragmenting multimedia data associated with navigation 
nodes in telecommunication network such as Internet. 

ADVANTAGE - Optimizes use of memory capacity and computational 
capacity, by fragmenting multimedia data and grouping identical 
or redundant data into same cell. 

DESCRIPTION OF DRAWING (S) - The figure shows a schematic view of 
cell comprising nodes associated with multimedia data. 

definitive cells (C"5-C f 7) 

multimedia data (D'5-D'9) 

nodes (N'6-N'9) 

pp; 9 DwgNo 2/2 

Title Terms: DATA; FRAGMENT; METHOD; IDENTIFY; PAIR; NODE; MINIMISE; 

PARAMETER; PROPORTION; DATA; COMMON; ASSOCIATE; DATA; FORMING; CELL 
Derwent Class: T01 

International Patent Class (Main) : G06F-007/00 ; G06F-017/30 ; 

H04N-007/24 
File Segment: EPI 



14/5/13 (Item 5 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. All rts . reserv. 

014604513 **Image available** 

WPI Acc No: 2002-425217/200245 

Related WPI Acc No: 2002-626900 

XRPX Acc No: N02-334351 

Data replication system for employee management system, matches 
data items of several master tables and replicates matched data items 
to generate replica table using particular data in master table 
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Abstract (Basic) : US 200| 

NOVELTY - A matchTtTig unit matches data items of a Hftmber of master 
tables (108,109) using particular data in the master tables as a 
joining key to join each other. A replicating unit replicates the 
matched data items based on the joining key to generate one 
replica table (121) . 

DETAILED DESCRIPTION ■ An INDEPENDENT CLAIM is also included for 
data replication program. 

USE - For replicating data of several master tables of database in 
employee management system. 

ADVANTAGE - The data of number of master tables is replicated in 
one replica table, and hence job execution time is minimized. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
data replica system. 

Master tables (108,109) 

Replica table (121) 
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Distributed database system for distributed system, has addition table 
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which records of duplication database are added/ deleted 
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Abstract (Basic) : JP 2002007441 A 

NOVELTY - An addition table (35) and a deletion table (34) 
containing updating information of duplication database (11,12) and 
deletion information of the database respectively, are compressed into 
a compression file (36) . The records in each database corresponding to 
the tables, are added or deleted synchronously using the compression 
file. 

USE - Distributed database system for use in distributed system. 

ADVANTAGE - The records are added/deleted synchronously within 
short time, thus high speed synchronization process is achieved. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
distributed database system. (Drawing includes non-English language 
text) . 

Duplication database (11,12) 
Deletion table (34) 
Addition table (35) 
Compression file (36) 
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Abstract (Basic) : US 20010007989 Al 

NOVELTY - The system works by integrating multiple knowledge base 
files into single files, keeping all of the parameters and constraints 
of these multiple files by identifying two or more common 
parameters among the files . These parameters are then merged into a 
single statement in the integrated file, the single statement listing 
all of the values of the several statements in the multiple files. 
Additional constraints are added to the new, smaller file to properly 
relate the values to a characteristic of the original files. Combining 
the value and activity constraints into a single constraint statement 
optimizes the resulting single file. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for a 
software engine for integrating a first and a second knowledge base 
file into a single smaller integrated third file. 

USE - For generating, merging, and optimizing Knowledge Bases. 

ADVANTAGE - The method allows the user to go through the selection 
and configuration process using the bottom-up approach, speeding the 
process up. 

DESCRIPTION OF DRAWING (S) - The figure shown is a flow chart 
illustrating a knowledge base merging and optimization process, 
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Document display method for newspaper report, received via Internet, 
involves comparing similarity between chosen document and original report 
and displaying combined common data and original information 
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Abstract (Basic) : JP 2000105769 A 

NOVELTY - The similarity between multiple reports is evaluated and 
then documents are divided into multiple portions . The similarity 
between each divided portion is evaluated and one document is chosen. 
The information common to chosen document and original report is 
detected . The common information and original information are 
combined and displayed. 

USE - For newspaper report, magazine report received through 
Internet . 

ADVANTAGE - Enables detection and display of original document, 
thereby enabling efficient perusal of report by user, 
pp; 26 DwgNo 1/24 
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Multilevel source data structure compression method used in field of 
document management in general purpose computer system 
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Abstract (Basic) : US 593^B2 A 




NOVELTY - The reference in the constituent data structure at the 
next higher level is replaced with a reference that refers to a 
constituent data structure stored in a destination data structure. The 
matching constituent data structures are identified. The links to the 
deleted constituent data structures are then replaced with links to the 
selected constituent data structure. 

DETAILED DESCRIPTION - The data structure at the next higher level 
refers to a constituent data structure at the selected level of the 
source data structure. Each constituent data structure has specific 
contents. All of the constituent data structures having the same 
contents are identified as matching . 

USE - For compressing multilevel source data structure for document 
management in general purpose computer system. 

ADVANTAGE - Since a reference is inserted in the compressed 
document set, that refers to the matching constituent data structure 
already contained in the compressed document set, avoids existence of 

duplicate constituent data structure in the compressed document 
set . 

DESCRIPTION OF DRAWING (S) - The figure shows the flow diagram of 
compress documents routine, 
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Title Terms: MULTILEVEL; SOURCE; DATA; STRUCTURE; COMPRESS; METHOD; FIELD; 

DOCUMENT; MANAGEMENT; GENERAL; PURPOSE; COMPUTER; SYSTEM 
Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 
File Segment: EPI 



14/5/28 (Item 20 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. All rts . reserv. 

012001830 **Irnage available** 

WPI Acc No: 1998-418740/199836 

XRPX Acc No: N98-326470 

Data processing system for printing of goods catalogue - searches grouped 
object data corresponding to input search data and outputs searched 
object data by output device 

Patent Assignee: TOPPAN PRINTING CO LTD (TOPP ) 

Number of Countries: 001 Number of Patents: 002 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 10171816 A 19980626 JP 96342704 A 19961206 199836 B 

JP 3424473 B2 20030707 JP 96342704 A 19961206 200345 

Priority Applications (No Type Date) : JP 96342704 A 19961206 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 10171816 A 7 G06F-017/30 

JP 3424473 B2 7 G06F-017/30 Previous Publ . patent JP 10171816 

Abstract (Basic) : JP 10171816 A 

The system stores several object data. These object data are 
divided into multiple groups , based on a preset rule and division 
information of each object data. The object data with which division is 
performed, is compared with an identification data . If they are 
identical then the matched data are stored. 

Input search data corresponding to attribute of each object data 
that are matching with identification data, are stored separately. 
Then, object data corresponding to input search data belonging to 
respecting group are searched. These searched data are then output, by 
an output device (3) . 

ADVANTAGE - Reads out object data, efficiently. 
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Abstract (Basic) : JP 10097544 A 

The system includes an enquiry process server, an I/O server and an 
accumulation server connected through a network. The record read by the 
I/O server is allocated to accumulation process server by a record 
allocation unit. The fields of a record are divided into groups and a 
group identifier is assigned to each group . Several records with 
same group identifier values are further grouped together. 

The accumulation of the records is based on the group identifier 
value. The storing area of the accumulated records is decided, also 
based on group identifier value. The accumulation result is 
periodically updated. Intermediate accumulation results are notified to 
the enquiry process server. The enquiry process server unifies the 
various intermediate accumulation results . 

ADVANTAGE - Realizes efficient database processing. 
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Data compressing method for data processing system - involves lexically 
sorting group of N cells so that lexical order of N cells is established 
based upon lexical value of substring in each of N cells 
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Abstract (Basic) : US 5659755 A 

The method involves selecting a block of N bytes in the data 
processing system, wherein N is greater than 1. A group of N data cells 
are created, each of the cells has a physical tag, P, for indicating a 
physical order in the group. The group of N cells are lexically sorted 
so that a lexical order of the N cells is established based upon the 
lexical value of the substring in each of the N cells. Matching strings 
are identified in the substrings of selected ones of the N cells in the 
lexically sorted group of N cells. The block of N bytes are encoded 
utilising the identified matching strings to produce a compressed data 
set. The compressed data set is efficiently produced by identifying 
redundant information in the lexically sorted group of N cells. 

ADVANTAGE - Quickly and efficiently compress block of data while 
maximising amount data processing that may be performed in parallel. 
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Abstract (Basic) : WO 9710537 A 

The method for identifying textual documents and multimedia files 
involves storing a number of document and multimedia records each of 
which represent a document or multimedia file. The document records 
have associated text information fields, each of which represents text 
from one of the textual documents, and the multimedia records have 
multimedia information fields representing only digital video or audio 
information and associated text fields, each representing text 
associated with one of the multimedia information fields. 

A single search query corresponding to the search topic is received 
pref in a natural language format, and an index database is searched in 
accordance with the single search query to simultaneously identify 
document records and multimedia records related to the single search 
query. A search result list having entries representing both textual 
documents and multimedia files related to the single search query is 
generated in accordance with the document records and the multimedia 
records identified by the index database search. Text or digital video 
or audio information corresponding to the search topic is retrieved by 
selecting entries from the search result list. 

USE - Automated multi-user system for identifying and retrieving 
text and multi-media files from various publisher sources. 

ADVANTAGE - Enables searching and retrieval of library or database 
to identify text documents and multimedia files relevant to query. 
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Abstract (Basic) : EP 508519 A 

The bibliometric storage system accesses documents (20, 22, 24) 
containing a number of postings or elements such as noun, adverbs and 
the like. Each posting has a document identifier and an offset. The 
postings are in a linear order such as decreasing frequency of 
occurrence . 

The information is then further compressed using pointers and 
tables to remove redundant information . This compression makes 
use of direct random addressibility to reduce storage and stores all 
the postings for a particular document together. 

ADVANTAGE - Allows easy expandability and updating while requiring 
roughly halve storage used by conventional systems. 
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PRODUCT NAME: iBaan for CRM (165018) 

SSA Global Technologies Inc (241032) 
500 W Madison #1600 
Chicago, IL 60661 United States 
TELEPHONE: (312) 258-6000 

RECORD TYPE: Directory 

CONTACT: Sales Department 

Baan ' s iBaan for CRM is a customer relationship management (CRM) system 
that allows businesses to identify client requirements, improve order 
accuracy, and reduce service costs. The system consolidates information 
collected from multiple customer touchpoints, eliminating redundant 
data entry. iBaan for CRM also identifies profitable customers, allowing 
companies to prioritize and target marketing and sales activities . The 
system collects sales result, customer profile, qualified lead to completed 
sale ratio, and other information. iBaan for CRM provides companies with 
opportunity management, sales forecasting, automated quoting, automated 
proposal generation, and remote data synchronization features. The system 
supports online marketing and offline fulfillment operations. 

DESCRIPTORS: CRM; Customer Service; Electronic Customer Service; Marketing 
Information 

HARDWARE: Hardware Independent 
OPERATING SYSTEM: Open Systems 
PROGRAM LANGUAGES: Not Available 
TYPE OF PRODUCT: Micro 
POTENTIAL USERS: Cross Industry 
PRICE: Available upon request 

REVISION DATE: 20030915 
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PRODUCT NAME: Merge/Purge Plus (058521) 

Group 1 Software Inc (400319) 
4200 Parliament PI 

Lanham, MD 20706-1844 United States 
TELEPHONE: (301) 731-2300 

RECORD TYPE: Directory 

CONTACT: Sales Department 

Merge/Purge Plus, a product from Group 1 Software, detects duplicates in 
mailing lists. There are significant production costs involved in direct 
mail, and duplication can be costly, both in terms of wasted postage and 
wasted production units. Merge /Purge Plus identifies and eliminates 
duplicate records in a single file or across multiple files using 
a sophisticated matching algorithm. It can also recognize duplicate but 



misspelled names. M/P Pl^^^an also eliminate unwanted re^vls . The user 
gains complete control ovff: how the matching algorithm worW, and can 
specify the algorithm as either loose, medium, tight, or equal for 
individual name matching. Users can also either include or exclude the 
first name as a requirement for duplicate detection, and can include 
apartment numbers as an extension of the street address, so that records 
are identified as duplicates only if the apartment numbers match . 
Merge/Purge Plus generates a full set of output files, including a mailing 
file fully cleaned of duplicates and purges, a duplicate names file, and an 
additional copy of the mailing file. 

DESCRIPTORS: Data Quality; Direct Marketing; File Conversion; List 
Processing; Mailing Lists; Marketing Information 

HARDWARE: IBM AS/400; IBM iSeries; IBM Mainframe; IBM PC & Compatibles; 
UNIX 

OPERATING SYSTEM: Linux; MVS; OS/400; UNIX; VSE; Windows NT/2000; Windows 
XP 

PROGRAM LANGUAGES: Not Available 

TYPE OF PRODUCT: Mainframe; Mini; Micro; Workstation 

POTENTIAL USERS: Marketing, Cross Industry, Direct Mail in U.S. and Canada 
PRICE: Available upon request 

REVISION DATE: 20031218 
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RECORD TYPE: Review 
REVIEW TYPE: Review 
GRADE: A 

Peoplesmith Software's Double Take 4.1, a database cleaning and 
conditioning utility, gets very good marks overall, especially for the 
ability to process many files, file types, and sizes; several available 
problem-solving methods; versatile output choices; excellent processing 
performance rates; and Open Database Connectivity (ODBC) compliance. 
However, learning Double Take 4.1 takes some time, and the software does 
not yet support SQL Server 7.1. With Double Take 4.1, database 
administrators can streamline four pervasive data problems in companies 
that need squeaky clean database stability without getting into data 
warehousing. Testers could eliminate duplicate records from single and 
multiple file sets; merge multiple files into one output file; or 
combine the two tasks into one operation called merge /purge, which 
cleaned 2 million records per hour. Double Take 4.1 uses a matchcode , 
or a collection of name and address components, that works effectively to 
represent the one-of-a-kind identify of a complete record. Keys are created 
and then compared . during the process that eliminates duplicate records 

Double Take's best advantage is its ability to find near-duplicates 
with misspellings, typos, nicknames, and different or multiple addresses. 
The Matchcode Editor permits refining of keys that pinpoint these records. 

PRICE: $3000 



COMPANY NAME: Peoplesmith Software (455458) 



Lties; File Conversion; IBM PC ^^C 



SPECIAL FEATURE : Charts^ 

DESCRIPTORS: Database UcTIities; File Conversion; IBM PC ^^Compatibles ; 

List Processing; Mailing Lists; Quality Assurance; Windows; Windows 

NT/2000 
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PRODUCT NAMES: MetaConnect for Database Applications (695556) ; 
Meta-Directory 1.0 (741728) 

TITLE: Netscape Teams with Isocor to Deliver Meta-Directory 

AUTHOR: Sullivan, Thomas 

SOURCE: ent, v4 n4 pl8(2) Feb 17, 1999 

ISSN: 1085-2395 

HOMEPAGE: http : //www. entmag. com 

RECORD TYPE: Review 

REVIEW TYPE: Product Analysis 

GRADE: Product Analysis, No Rating 

Isocor' s MetaConnect for Database Applications and Netscape Communications' 
Meta-Directory 1.0 will both be part of a new metadirectory technology to 
be jointly developed by the two vendors. Isocor is an Internet messaging 
and directory software developer. MetaConnect provides a join engine that 

identifies duplicate directory records in various directories, and 
automatically merges the information in one directory entry. The 
metadirectory platform will also use Isocor 's connector technology to 
provide integration with information sources, which can include operating 
systems (OSs), telephony products, and database applications. Netscape is a 
licensee of Isocor' s join technology, which is the basis of MetaConnect and 
Netscape's Meta-Directory 1.0 suite, which includes Netscape Directory 
Server, the metadirectory server, and some connectors. A central repository 
is required to build new Internet applications that allow transparent 
access to all of a company's data, says a senior product manager for 
Meta-Directory at Netscape. The suite will provide e-commerce applications 
with different directory information from human resource systems, 
databases, messaging systems, and network operating systems (NOSs) . 
Important functions of the suite include the ability to move data from 
multiple directories into the Meta-Directory, to keep data current, and to 
join data using namespaces. 

COMPANY NAME: Critical Path Inc (670413); Netscape Communications Corp 
(592625) 

SPECIAL FEATURE: Screen Layouts 

DESCRIPTORS: Computer Resource Management; Network Administration; Network 

Software 
REVISION DATE: 20020630 
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PRODUCT NAME: PERSIVAL (149195) 

Columbia University (734403) 

722 W 168th St #1030 

New York, NY 10032 United States 

TELEPHONE: (212) 305-3927 

RECORD TYPE: Directory 

CONTACT: Sales Department 

The PErsonalized Retrieval and Summarization of Image, Video And Language 
(PERSIVAL) system, offered by Columbia University's NLP Group, provides 
clinicians and patients with access to a wide range of online health 
care information. PERSIVAL personalizes searches, summaries, and 
presentation elements for end users. It references patient records that 
are available at Columbia Presbyterian Medical Center (CPMC) . It also 
allows users who are not CPMC patients to create new patient records . 
Referencing records, PERSIVAL automatically targets search results. The 
system identifies and summarizes multimedia content distributed across 
multiple local and online resources. It flags conflicting information and 
eliminates duplicate results. The system also identifies source types. 
Multimedia sources range from diagnostic test results to educational 
videos. PERSIVAL automatically summarizes video content. 

DESCRIPTORS: Consumer Health Care; Health Care; Indexing; Information 
Retrieval; Medical Diagnosis; Patient Care; Personalization; Search 
Engines 

HARDWARE: Hardware Independent 
OPERATING SYSTEM: Open Systems 
PROGRAM LANGUAGES: Not Available 
TYPE OF PRODUCT: Micro 

POTENTIAL USERS: Personalized Medical Information Retrieval 
PRICE: Available upon request 
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PRODUCT NAME: My Labels 6.1 Standard & Premium (169978) 

BVRP Software USA (643963) 
1333 W 120th Ave #314 

Westminster, CO 80234 United States 
TELEPHONE: (720) 330-1400 

RECORD TYPE: Directory 

CONTACT: Sales Department 

BVRP Software USA's My Labels, offered in Standard and Premium 6.1 
editions, streamlines the creation and printing of labels. The system 
includes 10,000 clip art images. It also can import logos or other images. 
My Labels works with Avery (R) , NEBS (R) , PaperDirect (R) , and other 
labels. Users can select from over 400 standard label sizes. Users also can 
define customized labels. The product imports information from a wide range 
of databases. It also can add POSTNET barcodes to labels. My Labels 
includes shrink-to-fit formatting and color adjustment features. It can 
print UPC-A, UPC-E, Code 39, Code 128, and other product barcodes. My 
Labels Premium 6.1 includes text alignment and rotation features. It 
includes 2.5 million clip art and photographic images. The program also 
allows users to remove duplicate records from imported data . Labels 
can be previewed before they are printed. 

DESCRIPTORS: Barcoding; Database Utilities; Labels; Print Utilities 
HARDWARE: IBM PC & Compatibles; Pentium 

OPERATING SYSTEM: Windows; Windows NT/2000; Windows XP 
PROGRAM LANGUAGES: Not Available 
TYPE OF PRODUCT: Micro 

POTENTIAL USERS: Cross Industry, Offices, Small Businesses 
PRICE: $19.95 

TRAINING AVAILABLE: Technical support 

OTHER REQUIREMENTS: 32MB RAM; Win 9x+ required 

REVISION DATE: 20030921 
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PRODUCT NAME: Copernic Agent 6 Personal, Professional (134546) 

Copernic Technologies Inc (640913) 
360 Franquet #60 
Ste-Foy, PQ GIN 4N3 Canada 
TELEPHONE: (418) 527-0528 

RECORD TYPE: Directory 

CONTACT: Sales Department 

Copernic Agent 6 Personal, Professional, and Basic from Copernic 
Technologies is a Web search engine that allows users to target Internet 
searches, organize results, access hidden data, create customized search 



'categories, and automatical. y remove results with broken As. Copernic 
Agent's parallel search features allow users to employ ove^^l,000 
specialized search engines. The engines are organized into 120 categories. 
The program removes duplicate records from search results. Copernic 
Agent's management tools let users filter, group, and sort results; save 
pages; search within results; detect duplicate pages; view detailed search 
histories; refine search parameters; create custom categories; and export 
results in HTML, CSV, and other formats. The system integrates with 
Microsoft (TM) Internet Explorer and Microsoft Office. Users employ a 
toolbar to access search engines from the Microsoft the programs. Copernic 
Agent includes context-sensitive help features. The Personal Edition adds 
features such as region filters, result-grouping tools, and Microsoft 
Office integration. The Professional Edition adds search updaing, Web page 
analysis, page-language filters, date-modified filters, and much more. 

DESCRIPTORS: Alerts; Front Ends; Internet Content Filters; Search Engines; 
Webmasters 

HARDWARE: IBM PC & Compatibles 

OPERATING SYSTEM: Internet Explorer; Open Systems; Windows NT/2000 
PROGRAM LANGUAGES: Not Available 
TYPE OF PRODUCT: Micro 

POTENTIAL USERS: Cross Industry, Site Owners 
PRICE: Available upon request 

DOCUMENTATION AVAILABLE: Online documentation 
REVISION DATE: 20030717 
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PRODUCT NAME: Confort (121703) 

Tripos Inc (481491) 
1699 S Hanley Rd 

St Louis, MO 63144-2913 United States 
TELEPHONE: (314) 647-1099 

RECORD TYPE: Directory 

CONTACT: Sales Department 

Tripos's Confort (TM) is a conformational analysis tool that examines 
drug-sized molecules. The product generates diverse conformers for 
receptor-ligand docking. It can be applied to entire structures or to 
user-defined substructures . Confort can search rings and acyclic rotatable 
bonds. The system removes duplicate information . Referencing 
user-defined number, energy, and diversity parameters, it outputs 
conformers. Confort can perform 3D database searches starting from multiple 
conformations. It can identify global minimum and low energy drug candidate 
conformations. The product handles the SMILES, UNITY hitlist/SLN, mol2, and 
MDL SDfile formats. Confort works with Tripos's UNITY (R) 3D search, 
DISCOtech (TM) pharmacophore hypothesis generation, QSAR with CoMFA (R) 
multi- conformer 3D QSAR analysis, and FlexS (R) shape-based screening 
programs . 

DESCRIPTORS: Bioinf ormatics ; Molecular Modeling; Pharmaceuticals; Research 
& Development 

HARDWARE: HP; IBM PC & Compatibles; Silicon Graphics; Sun; UNIX 
OPERATING SYSTEM: HP-UX; IRIX; Oracle; Solaris; Windows; Windows NT/2000 
PROGRAM LANGUAGES: Not Available 
TYPE OF PRODUCT: Mini; Micro; Workstation 



POTENTIAL USERS: Pharma^^Licals , Conformational Analysis 
PRICE: Available upon request 

REVISION DATE: 030206 
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PRODUCT NAMES: DB Assistant 2.0 Beta (775576) 

TITLE: DB Assistant provides an ambulatory path through your data jungle 

AUTHOR: Hollander, Geoffrey 

SOURCE : InfoWorlcl, v21 n42 p68(l) Oct 18, 1999 
ISSN: 0199-6649 

HOMEPAGE: http : / /www . inf oworld . com 

RECORD TYPE: Review 
REVIEW TYPE: Review 
GRADE : A 

Bloodhound Software's DB Assistant 2.0 beta, a data transformation software 
package, adroitly cleans data for use in data warehouses data marts, or 
other decision support applications. Because clean, standardized data is 
required for such resources, DB Assistant's speed and efficiency, which 
result in much lower cleaning time than other products, are a significant 
advantage. Users can combine or nest tasks and reuse procedures. Processing 
speed is good and duplicate records can be weeded out. However, the merge 
module lacks grouping and cross-record summarization features, and the 
purge module cannot shift child records. DB Assistant 2.0 is more flexible, 
robust, and economical than competing products, including Smart DB 
Workbench and is more full-functioned than Data Junction or DBMS. Although 
DB Assistant 2.0 would be of great use to users of operating systems other 
than Windows, only Windows is supported. This release supports Object 
Linking and Embedding (OLE) for Databases, which in turn supports links to 
structured and unstructured data sources. DB Assistant 2.0 also works with 
many data, including text files, databases, spreadsheets, and legacy data. 
Matchcodes Administrator is used to remove duplicate records , by 
creating components that best represent the unique identity of each record, 
then comparing the results. 
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SECOND OR PARALLEL? OR MULTIPLE? 

52 18205376 RECORD OR RECORDS OR DATA OR FILE OR FILES OR INFORMATION 

OR CITATION?. 

53 18227504 RECORD OR RECORDS OR DATA OR FILE OR FILES OR INFORMATION 

OR CITATION? 

54 14795806 REMOV? OR DELETE? OR DELETING? OR ERASE? ? OR ERASING? OR - . 
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OMPAR???? OR MATCH? OR PREVENT? 
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Streamline the Registration Process with EMPI . (Technology Information) 

Roget, Melisa 

Health Management Technology, 20, 5, 34 
June, 1999 

ISSN: 1074-4770 LANGUAGE: English RECORD TYPE : Fulltext 

WORD COUNT: 167 6 LINE COUNT: 0014 6 

possible duplicate records were reviewed between January and 
September. More than 35,000 of these records were identified as 
duplicates and were merged - reducing the EMerge database to more than 
1.6 million records. As the project grew to include multiple 
implementations, UAB decided additional resources were needed. The 
healthcare consulting firm, Insource Management Group, was enlisted to 
provide project management. 
Phase 2 : As of . . . 
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Streamline the Registration Process with EMPI . (University of Alabama at 

Birmingham Health System 1 s use of HIE ' s enterprise -wide master person 

index system, EMerge) (Company Operations) 

Roget, Melisa 

Health Management Technology, 20, 5, 34 
June, 1999 

ISSN: 1074-4770 LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 167 6 LINE COUNT: 0014 6 

possible duplicate records were reviewed between January and 
September. More than 35,000 of these records were identified as 
duplicates and were merged - reducing the EMerge database to more than 
1.6 million records. As the project grew to include multiple 
implementations, UAB decided additional resources were needed. The 
healthcare consulting firm, Insource Management Group, was enlisted to 
provide project management. 
Phase 2 : As of . . . 
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How to find who. (master person indexes in health care systems integration) 
(includes a related articles on Matrix Rehabilitation's use of 
Datamedic's PMstation practice management solution and on TMC 
Healthcare's use of 3M's Healthcare Enterprise Management System) 
(Technology Information) 
Cupito, Mary Carmen 

Health Management Technology, vl9, n6, p32(5) 
May, 1998 

ISSN: 1074-4770 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 3717 LINE COUNT: 00306 

... expected to do a lot: receive patient or member information from 

many systems, assign unique identifiers to each individual, merge 
records pertaining to the same person, update its database, remove 



'duplicate records , cc^^late a patient ' s records thr^^hout the 

enterprise, keep the source system's identifier or pointers to it... 
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Toney, Stephen R. 
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... 2] June E. Slach, "Detection and Elimination of Duplicates from 

Multidatabase Searches," Bulletin of the Medical Library Association 73, 
no. 3:235-37 (July 1985). [3]C. A. Giles and others, A Computerized Scheme 
for Duplicate Checking of Bibliographic Data Bases (NTIS CRNL/CSD-5, 
Aug. 1976). [4]C. A. Giles and others, An Experiment... 

. . .CONF-76-1011-2, n.d.). [5]Thomas B. Hickey and David J. Rypka, 
"Automatic Detection of Duplicate Monographic Records , " Journal of 
Library Automation 12, no. 2:125-42 (June 1979). [6] James L. Dolby... 

... Retrieval, " Journal of Library Automation 1, no. 4: 250-60 (Dec. 1968). 
[8]Pankaj Goyal, " Duplicate Record Identification in Bibliographic 
Databases," Information Systems 12 , no . 3 : 239-42 (1987). [9]E. J. 
Yannakoudakis . . . 
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(c) 2004 The Dialog Corp. All rts. reserv. 
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The American Association for Artificial Intelligence (AAAI) Awards the Best 
In Practical Innovations of Intelligent Systems Applications 

PR NEWSWIRE 
July 27, 2000 

JOURNAL CODE: WPRW LANGUAGE: English RECORD TYPE: FULLTEXT 
WORD COUNT: 677 

(USE FORMAT 7 OR 9 FOR FULLTEXT) 

... integrated illustration of the patients' medical condition (s ) , 

course of treatment and disposition while intelligently deleting 

redundant information , combining complementary information to 

create the integrated picture, and assessing the medical risks and 
service. LifeCode also has a "self-awareness" feature that recognizes its 
own limits . . . 
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Netscape and ISOCOR Team to Deliver New Me ta -Directory Technology; Netscape 
Announces New Netscape (R) Meta-Directory 1 . 0 

DATE: January 18, 1999 08:29 EST WORD COUNT: 758 

. . . join" engine that will be a key part of the combined meta-directory 
platform. The join engine identifies duplicate directory records 
in multiple directories and automatically combines the information 
into a single, more complete directory entry. The meta-directory platform 
will also . . . 

...technology to provide integration with a variety of information sources, 
including operating systems, telephony products, database applications, 
and others . 

"With business to business electronic commerce expected to grow to more 
than. . . 
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DIALOG (R) File 16: Gale Group PROMT (R) 
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06483716 Supplier Number: 55143940 (USE FORMAT 7 FOR FULLTEXT) 
Low-cost Double Take keeps data honest. ( People Smi th ' s Double Take 4.1 
backup software) (Software Review) (Evaluation) 

Hollander, Geoffrey 
InfoWorld, v21, n28, p56 
July 12, 1999 

Language: English Record Type: Fulltext 
Article Type: Evaluation 
Document Type: Magazine/ Journal ; Trade 
Word Count: 613 

PeopleSmith 1 s Double Take 4.1 provides an effective solution for 
database administrators to simplify four persistent data problems in 
companies that require immaculate database reliability without taking the 
plunge into high-end warehousing. During my tests, I was able to purge 
duplicate records from single and multiple file sets, merge 
several files into a single consolidated output file, or combine the two 
into a single operation known as a "merge/purge" at an impressive rate of 2 
million records per hour. 

Double Take uses a matchcode , or collection of name and address 
components that best represent the unique identity of a... 
...BOTTOM LINE: VERY GOOD 
Double Take 4.1 

Summary: For database administrators who need to find and remove 
troublesome duplicate records or merge unique records from 
disparate databases , Double Take provides the most comprehensive, 
efficient, and reasonably priced answer - especially if you're not ready 
for a full-blown database warehousing solution. 

Business Case: Duplicate records are costly and wreak havoc on every 
level from. . . 
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Netscape, Isocor Team On^ba Directory Technology 01/21/! 

Newsbytes PM, pNA 
Jan 21, 1999 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 212 



(USE FORMAT 7 FOR FULLTEXT) 
TEXT: 

...the companies say will be a "key part" of the combined meta- directory 
platform. The join engine reportedly identifies duplicate directory 
records in multiple directories and "automatically combines the 
information into a single, more complete directory entry." ISOCOR' s 
connector technology will also... 

...directory platform to provide "integration" with such information 
sources as operating systems, telephony products and database 
applications. Reported by Newsbytes News Network, http://www.newsbytes.com 
. (19990121/Press Contact: ISOCOR Ltd... 
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04358954 Supplier Number: 46392902 (USE FORMAT 7 FOR FULLTEXT) 
New Software Product: "Data Clustering Engine" 

News Release, pN/A 
May 17, 1996 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal ; Trade 
Word Count: 293 

(USE FORMAT 7 FOR FULLTEXT) 
TEXT: 

. . .The product can be useful in many applications for large organizations 
that have multiple, varied databases . It can be used to help eliminate 
duplicate records or identify various relationships between records. 
It can also be used for " data mining " purposes for customer 
information, marketing or fraud investigation systems. The product can also 
be used as a tool to assist in combining multiple files into one " data 

warehouse " , both to set up the warehouse for the first time and also as 
an on. . . 

...any other information that the user chooses. The product uses it's own 
high performance database which can access data from any of the popular 
databases - The product has its own Application Programming Interface (API) 
which application programs can use to . . . 
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Access Database Manager Comes Of Age 
Open Systems Today, p85 
April 25, 1994 

Language: English Record Type: Fulltext 
Document Type: Newsletter; Trade 
Word Count: 2433 

... query's data will come. These tables can be attached to either 

local or remote databases , and one query can combine data from 
multiple databases . You then define or remove any default 
relationships between these tables, as needed. Then select each field you 
want to. . . 
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02637258 Supplier Number: 43510891 (USE FORMAT 7 FOR FULLTEXT) 
Software Converts Legacy To Other Formats 12/07/92 

Newsbytes, pN/A 
Dec 7, 1992 

Language: English Record Type: Fulltext 
Document Type: Newswire; General Trade 
Word Count: 659 

... minutes, the executive VP said. Without performing any programming, 

the user can specify conditional queries, merge data from multiple 
sources, and check data integrity, operations previously possible only 
through handwritten programs. The Data Conversion Tool also produces 
detailed documentation of the conversion and the inter- database 
relationships. 

The Extract Libraries, a product still in the pilot phase, consists of 
packaged definitions. . . 
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04402086 Supplier Number: 46457689 (USE FORMAT 7 FOR FULLTEXT) 
Apertus Announces New Relationship with Postalsoft, Extends Relationship 

with Object Design Flagship Product Enterprise/ Integrator Drives Addition 

of "Best-of -Breed" Partners. 

Business Wire, p06111304 
June 11, 1996 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 548 

. . . oriented development tool for creating applications that transform, 

cleanse, merge, and synchronize data from heterogeneous databases . More 
than a simple data transformation tool, Enterprise/Integrator allows users 
to graphically develop solutions for complex data quality problems, such as 

identifying and consolidating logically redundant data and 
resolving value conflicts. The product is available on Sun SPARC, HP 9000 
Series , and . . . 
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Measuring the performance of a merging algorithm: mismatches, 

missed-matches , and overlap in Israel's Union List. (Union List of 

Monographs ) 

Meir, Daniel D.; Laszinger, Susan S. 

Information Technology and Libraries, vl7, n3, pll6(8) 
Sept, 1998 

ISSN: 0730-9295 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 4207 LINE COUNT: 00395 

... Merge and Not to Merge," 216. 

(5.) Martha E. Williams and Keith D. MacLaury, "Automatic Merging of 
Monographic Data Bases — Identification of Duplicate Records in 
Multiple Files : The IUCS Scheme," Journal of Library Automation 12 (June 
1979) : 166. 

(6.) Stephen R. Toney. . . 
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New Ovid Release Enhances Searching. 

EContent, 23, 3, 8 
June, 2000 

ISSN: 1525-2531 LANGUAGE : English 

WORD COUNT: 8 8 LINE COUNT: 00011 
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TEXT: 

...and Deduping, which lets users combine multiple Ovid data bases, 
search them simultaneously, and automatically remove duplicate records 
from search results. (800/950-2 035; 212/563-3006; http://www.ovid.com) 
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Cumulative Contents of the Journal of Library Automation, 
1968-1981. (Directory) 

Information Technology and Libraries, 17, 4, 233(1) 
Dec, 1998 

DOCUMENT TYPE: Directory ISSN: 0730-9295 LANGUAGE: English 

RECORD TYPE: Fulltext 

WORD COUNT: 5731 LINE COUNT: 00499 

of Fixed-Length Keys Derived from Title Strings" (p. 143-55), Keith 
D. MacLaury 

"Automatic Merging of Monographic Data Bases — Identification 
of Duplicate Records in Multiple Files : The IUCS Scheme" 
(p. 156-68), Martha E. Williams and Keith D. MacLaury 

Communications: "Maximum. . . 
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Technology Closeup: Main Entry Point For Contacts -- Support for vCard is 

key. (contact manager software that support the vCard standard) 

(Technology Information) 

Correia, Edward J . 

Computer Reseller News, n805, p99(l) 
August 31, 1998 

ISSN: 0893-8377 LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 322 LINE COUNT: 00027 

. . . the others can duplicate. 

The next best supporter of the standard was ACT, which can merge 
vCards into database records with dupe checking • Neither the 
Organizer 5.0 beta nor GoldMine 4.0 features vCard duplicate checking; both 
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Apertus Announces Development Tool for Enterprise -wide Data Integration 



Enterprise/ Integrator in Building the Data Warehouj 

Business Wire, p8141136 
August 14, 1995 

LANGUAGE: English RECORD TYPE: Full text 

WORD COUNT: 4 26 LINE COUNT: 00053 

TEXT: 

...transformation tool, it allows the development of solutions for 
complex data integration issues that include identifying and 
consolidating logically redundant data such as names and addresses. It 
also resolves value conflicts such as identifying a correct address from 
multiple , inconsistent records . (See August 8 Dow Jones article, " Data 
Warehouses Seen As Big Opportunity for Tech Cos.")... 
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To merge and not to merge: Israel's Union List of Monographs in the context 
of merging algorithms. 

Lazinger, Susan S. 

Information Technology and Libraries, v!3, n3, p213(7) 
Sept, 1994 

ISSN: 0730-9295 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 5819 LINE COUNT: 00472 

cleanup and deduplication" algorithm described by Toney, the first 
pass inspected the entire database and grouped the records into 
"pools." The second pass examined each pool with a more precise algorithm 
to find duplicates. The advantage to ... together records derived from a 
standard LC cataloging in a minimum of processing time. A second phase 
processed records that didn 1 t match on the minimum set of data elements 
through the remainder of the algorithm, comparing the... 

...of Illinois algorithm consisted of a first step that generated a key to 
partition the database into many small sets of possible duplicates based 
on the last two digits of the. . . 

. . .with its very tight matching criteria, state that "the . . . program 
committed very few errors in matching different records but missed many 

duplicates ."(26) Thus, in one sentence they pinpoint the major problem 
which must be faced in... of Library Automation 12 (June 1979): 131; Martha 
E. Williams and Keith D. MacLaury, "Automatic Merging of Monographic 
Data Bases -- Identification of Duplicate Records in Multiple 
Files : The IUCS Scheme," Journal of Library Automation 12 (June 1979): 
159-60. 

(17 . ) Coyle, "Record. . . 
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Cleanup and deduplication of an international bibliographic database. 

Toney, Stephen R. 

Information Technology and Libraries, vll, nl, pl9(10) 
March, 1992 
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... s own. [Tabular Data Omitted] 
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TEXT: 

. . .begins to search multiple databases with overlapping sources, the 
need for a way to sort merged output, identify the duplicates , and 
retain the records one chooses becomes imperative. Duplicate detection 
is a longawaited, major system enhancement which allows searchers to do all 
of these things. 
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RECORD TYPE: Fulltext 
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WORD COUNT: 302 

... the others can duplicate. 

The next best supporter of the standard was ACT, which can merge 
vCards into database records with dupe checking . Neither the 
Organizer 5.0 beta nor GoldMine 4.0 features vCard duplicate checking; 
both. . . 

9 



• 9 

Set Items Description 

51 7040004 DUPLICAT? OR DUPE? ? OR COMMON? OR SAME? OR REDUNDAN? OR R- 

EPLICA? OR REPEAT? OR IDENTICAL? OR DOUBLE? OR ANALOGOUS? OR - 
SECOND OR PARALLEL? OR MULTIPLE? 

52 10979839 RECORD OR RECORDS OR DATA OR FILE OR FILES OR INFORMATION 

OR CITATION?. 

53 10990032 RECORD OR RECORDS OR DATA OR FILE OR FILES OR INFORMATION 

OR CITATION? 

54 6119049 REMOV? OR DELETE? OR DELETING? OR ERASE? ? OR ERASING? OR - 

DEDUPE? ? OR CHECK? OR FIND???? OR IDENTIF? OR DETECT? OR C- 
OMPAR???? OR MATCH? OR PREVENT? 

55 330820 S1(3N)S3 

56 11940 S5(5N)S4 

57 9102853 MERGE? OR MERGING? OR CONSOLIDAT? OR COMBIN? OR JOIN???' OR 

GROUP? OR COMPRESS? OR COMPOSITE? 



S8 


16089 


S7 (5N) S5 






S9 


882 


S6 AND S8 






S10 


2755530 


MEDICAL? OR PATIENT? 


OR HEALTH? 


OR HOSPITAL? OR CLINICAL? 


Sll 


177 


S10 AND S9 






S12 


2755530 


MEDICAL? OR PATIENT? 


OR HEALTH? 


OR HOSPITAL? OR CLINICAL? 


S13 


177 


Sll AND S12 






S14 


176 


Sll (S) S12 






S15 


159 


RD (unique items) 






S16 


131 


S15 NOT PY>2001 






S17 


130 


S16 NOT PD>20010905 






S18 


44 


S17(S) (DATABASE? OR 


DATA ( ) BASE? 


OR DATA ( 3N) (WAREHOUS? OR 



WARE()HOUS? OR MINE? ? OR MINING?) OR DATAMIN? OR DB OR DBS OR 
RECORD? ( ) MANAGE? ) 



S19 


2 


S17 (S) (EMR OR ELECTRONIC () MEDICAL () RECORD? 


S20 


3 


S17 AND (EMR OR ELECTRONIC () MEDICAL () RECORD? 


S21 


3 


S20 AND S6 


S22 


43 


S18 NOT S21 


S23 


108 


S9 (10N) (DATE? OR TIME? OR CONTENT?) 


S24 


102 


RD (unique items) 


S25 


85 


S24 NOT PY>2001 


S26 


1 


DUPE () CHECK? AND (ACT OR ARES) 



?show files 

File 15:ABl/lnform(R) 1971-2004/Mar 30 

(c) 2004 ProQuest Inf o&Learning 
File 9:Business & Industry(R) Jul/1994-2004/Mar 30 

(c) 2004 The Gale Group 
File 610: Business Wire 1999-2004/Mar 31 

(c) 2004 Business Wire. 
File 810: Business Wire 1986-1999/Feb 28 

(c) 1999 Business Wire 
File 275:Gale Group Computer DB(TM) 1983-2004/Mar 31 

(c) 2004 The Gale Group 
File 476: Financial Times Fulltext 1982-2004/Mar 31 

(c) 2004 Financial Times Ltd 
File 624 :McGraw-Hill Publications 1985-2004/Mar 30 

(c) 2004 McGraw-Hill Co. Inc 
File 621:Gale Group New Prod . Annou . ( R) 1985-2 004/Mar 31 

(c) 2004 The Gale Group 
File 636:Gale Group Newsletter DB(TM) 1987-2004/Mar 31 

(c) 2004 The Gale Group 
File 613: PR Newswire 1999-2004/Mar 31 

(c) 2004 PR Newswire Association Inc 
File 618:Xinhua News 1999-2004/Mar 30 

(c) 2004 Xinhua News via Comtex 



?t s21/3,k/l-2 



21/3, K/l (Item 1 from file: 15) 

DIALOG (R) File 15 :ABI/ Inform (R) 

(c) 2004 ProQuest Inf o&Learning . All rts . reserv. 
02521370 116356554 

Issues in the multi -disciplinary assessment of healthcare information 
systems 

Heathfield, Heather; Hudson, Peter; Kay, Stephen; Mackay, Lesley; Marley, 
Tom; Nicholson, Lorraine; Peel, Victor; Roberts, Ruth; Williams, John 
Information Technology & People vl2n3 PP: 253-275 1999 
ISSN: 0959-3845 JRNL CODE: OTP 
WORD COUNT: 9650 

Issues in the multi -disciplinary assessment of healthcare information 
systems 

...ABSTRACT: considers the problems of a multi-disciplinary team working 
together to understand and evaluate a healthcare information system, 
which itself is situated in a complex organisational and political 
environment. A general... 

. . . faced by evaluators of such systems is provided. A specific evaluation 
project is described (Electronic Patient Records in the UK National 
Health Service) , and an account of the evaluation process as it occurred 
is given. Some of... 

TEXT: Heather Heathfield : Computing Department, Manchester Metropolitan 
University, Manchester, UK 

Peter Hudson: Health Services Management Unit, University of Manchester, 
Manchester, UK 

Stephen Kay: Department of Computing, University of . . . 

Tom Marley: Department of Computing, University of Manchester, 
Manchester, UK 

Lorraine Nicholson and Victor Peel: Health Services Management Unit, 
University of Manchester, Manchester, UK 

Ruth Roberts and John Williams: School of Postgraduate Studies in Medical 
and Health Care, Swansea, UK 

Introduction 

This paper examines some of the important, and often difficult issues... 
...source of data it provides. 

The subject of this evaluation project is that of Electronic Patient 
Records (EPRs) which are a relatively new and unexplored application 
domain. The first section of... 

. . .by the team, about the process of working as a multi-disciplinary team. 
Background 

Electronic patient record systems 

The use of information technology in healthcare is not new. The first 
digital computers were developed in the 1940s, and by the 1960s Hospital 
Information Systems (HIS) began to emerge. HIS are intended primarily as a 
means of communication, providing order-entry and results-reporting 
functionality. Whilst they may store some basic patient details, they do 
not contain detailed clinical information and are not intended to 
directly support clinical care. HIS are routinely used in many hospitals 
in both the UK and USA, as well as developing countries. 



Given 



the 



apparent 




ccess of HIS, attention tu 




:d towards the 



computerisation of patient records. Paper-based records have many 
problems associated with their use, for example availability, illegibility 

. . . EPRs as a solution to these problems, and likely to lead to significant 
improvements in healthcare . Research into the design and development of 
EPRs began in the 1970s. Three important early systems were developed in 
the USA, namely COSTAR (Barnett et al . , 1979), the Regenstrief Medical 

Record System (Pryor etal., 1983) and The Problem Orientated Medical 

Record System (Weed, 1969) . 

Today, EPRs are an important research area in health informatics, as can 
be seen by the numerous EPR projects, conferences devoted to the topic. . . 

... In 1991 the USA Institute of Medicine published a committee report 
titled The Computer-based Patient Record: An Essential Technology for 
Patient Care which stated that computer-based patient record systems 
have a unique potential to improve patient care (Dick and Steen, 1991) . 
This report has been extremely influential in setting out the requirements 
of computer-based patient record systems. Work on EPRs is progressing in 
the USA (e.g. Carpenter, 1994; Churgin... 

...and Steen, 1995) and Europe (e.g. van der Lei et al . , 1993). 
Evaluating electronic patient records 

Evaluations of healthcare applications first received serious attention 
in the 1970s (e.g. de Dombal et al . , 1974... 

...and "how" of evaluation (e.g. Wyatt and Spiegelhalter , 1990). 

As with any new intervention, healthcare applications do not exist 
independently of the economic realities that shape their design and 
influence the potential for their successful use in routine settings. The 
majority of healthcare systems were conceived as research projects, 
funded by government or private grants. The ultimate transmission... 

. . . had expired. Thus, the ability to demonstrate economic benefits is seen 
as essential by many healthcare professionals. However, there are other 
recognised potential benefits of IT, such as improved quality of patient 
care, brought about, by example, through better access to more accurate 
information (Detmer and Steen, 1995) . 

Economic analyses can be difficult for large healthcare applications of 
IT, and there are severe problems in applying the results of such analyses 
. . . Van der Loo (1995) examined the type of study design employed in the 
evaluation of healthcare information systems (1974-1995) . This study set 
out a "gold standard" of evaluation which comprises an economic analysis 
coupled with a randomised controlled clinical trial (RCT) , and scored the 
evaluation studies examined against this standard. Of the 108 studies... 

. . . studies and there is a strong body of opinion which advocates the need 
for all clinical systems to undergo RCTs . This debate is still highly 
topical as illustrated by two recent papers in the British Medical 
Journal (Donaldson, 1996; Locke, 1996) . 

Donabedian's classic work of the 1960s, recognised that the quality of 
health services can be measured along three dimensions: structure (or 
resources), process and outcomes (Donabedian, 1966... 

. . . systems focused on process and evaluations of treatment systems focused 
on outcomes, systems to support clinical care were generally examined in 
terms of structure. 



The application of RCTs to EPRs is. 



. . . recipients from wh^B to randomise, introducing rM^niness into the 

implementation of whole^^r part of hospital systems is^fmpossible . Each 
institution will have many different characteristics which it would be 
impossible. . . 

... into, virtually, a "national programme". Even randomisation within a 
particular institution (for example, randomly allocating patients to a 
clinician who uses a particular EPR system and one who does not) will... 

... an objective, when in reality this is an intermediate benefit that may 
produce benefits in patient care or the organisational setting. The 
second point highlighted by this review was that most. . . 

. . . recently, there is a growing awareness amongst those involved in the 
development and implementation of clinical systems, that social and 
organisational issues are at least of equal importance as technical issues 



... promote understanding of why such problems occurred (Rotman et al . , 
1996) . 

Recently the Computer-based Patient Record Institute of the USA published 
a discussion document on system evaluation. It gave support... 
... comparisons difficult. To complicate matters the term EPR is often used 
interchangeably with Computer-based Patient Record and also Clinical 
Workstation . 

The determination of the evaluation questions. Given the plethora of 
possible questions how does... the early 1970s with a few isolated but 
important local initiatives which sought to establish hospital wide 

clinical information systems, e.g. at the London Hospital (Barber et 
al . , 1994) . In the 1970s there was an attempt to build a national standard 

hospital information system. However, following an evaluation and review, 
this project was terminated in 1980. In 1989 the NHS launched the Hospital 

Information Support Systems (HISS) Project. This funded several pilot 
sites in the development of electronic communication systems to provide 
ward order-entry and results-reporting for individual patients . In 
addition to results, these systems store some basic patient 

identification details and an administrative, computerised index to those 

patients . They do not generally contain detailed clinical information 
and are not intended to directly support clinical care. Their role is to 
assist in the administration of care, the reimbursement of costs... 

. . . the UK is currently estimated to be in the region of approximately 20 
district general hospitals , from a total of 350 which each have more than 
300 beds. 

In 1994 the. . . 

...through its Information Management Group (IMG) , initiated two programmes 
of work which sought to improve patient records using information 
technology. The Electronic Patient Record (EPR) project is a three year 
research and development programme to assist clinicians in acute hospitals 
to provide better care to patients through the use of electronic 
patient record systems (NHS Executive, 1994a) . The Clinical Workstation 
(ICWS) project is a two year project concerned with providing interface 
facilities to an. . . 

... The PIDs were structured along the lines of sub-projects, for example 
document archiving or clinical terms. 

Burton Hospitals NHS Trust and Wirral Hospital NHS Trust are the 
demonstrator sites for the EPR project. Both these sites had established... 

...additional funds to extend the functionality of these systems to provide 
more comprehensive and richer clinical data for routine clinical care. 
The demonstrator sites for the ICWS project are Winchester and Eastleigh 



Healthcare NHS TrusA Bethlem and Maudsley NHS Tru^^nd Horizon NHS 
Trust. 

Despite the fact that there has been almost 30 years of experience with 
hospital -based clinical information systems in the NHS, published 
studies of system evaluation are scarce. Those that do... 

... on both system costs and benefits. Given the high estimated level of 
investment in UK hospital systems (Pounds 220million per annum) the Audit 
Commission (1995a, 1995b) has stated that there have... 

. . . sites themselves, other sites involved or about to embark on EPR/ICWS 
development, manufacturers of clinical systems, the EPR and ICWS 
Programme Boards and the Research and Development Directorate (RDD) of . . . 

. . . setup after previous HIS evaluation work (Welsh Office, 1993, 1996) 
identified key evaluation questions concerning clinical , organisation, 
technical, financial and managerial issues. PROBE provides guidance for the 
NHS on evaluating large... 

... both technical and social factors, using quantitative and qualitative 
methods, is currently considered important in healthcare . The Centre for 

Health Informatics at the Health Services Management Group of the 
University of Manchester instigated the formation of a multi-disciplinary 
team which included clinicians, social scientists, computer scientists, 

health service management experts and economists. They provide a 
combination of experience of the clinical environment, healthcare 

organisation, medical informatics technologies, patient records and 
the NHS organisation. They also have experience of ...projects. The 
projects had already started preventing any straightforward before and 
after comparisons. From a clinical perspective the RDD were keen to 
demonstrate improvements in patient care, as documented by patient 
outcomes . The problem lies in the fact that such documentation is not 
currently available and. . . 

. . .underlying team principles: 

In this particular project we cannot distinguish (for evaluation 
purposes) between a Hospital Information System (HIS), EPR and ICWS. The 
individual EPR/ICWS sites draw on the data... 

. . . combinations of paper-based and electronic data which together form the 
concept of a "total patient record". Our evaluation must explore the use 
and consequences of these combinations . 

A successful evaluation. . . defined by the team, are listed below. They 
include a wide range of perspectives, including clinical , technical, 
social, cultural, organisational, and managerial, reflecting the different 
backgrounds and perspectives of the team. 

What is the impact of the technology on clinical management? This will 
be considered at three levels: individual patient care, management of 
services and resource mangement. 

- What is the impact on the roles, the. . . 
...at the implementation sites? 

- Can the costs and benefits of such developments/ technologies be valued? 

Patient record systems and technologies: how useful and usable are 
they? 

- What is the relationship between. . . 



. . . clinicians and the system) rather than as a technique to gain wider 
understanding of the hospital context. However, this overlap in methods 
between different groups initially caused some problems, as different... 



. . . study. It would be very difficult to separate the^effect of these 
individual technologies on clinical practice. Furthermore, the recipients 
of the evaluation results may not perceive such an arbitrary distinctionbut 
instead makes the concept of a "total patient record" central to the 
framework. This allows us to examine the relationship between paper and. . . 

. . . employed and locations in which the record is stored. Using the concept 
of a "total patient record" also enables us to highlight specific 
relationships which are more pertinent to an EPR. . . 

...listed above. A simplified example is given below. 

Take research question one of the list. Patient record systems and 
technologies: how useful and usable are they? One of the specific criteria 



... selected sites. Methods used include observations, structured and 
semi-structured interviews, questionnaires and examination of patient 
records. Many different personnel at the sites have been included in the 
study: nursing staff, medical staff, managers, technical staff, and 
sub-project leaders. A great deal of effort was needed... 

. . . setup the site visits at times that were convenient for both the team 
members and hospital staff, and in a manner that caused as little 
disruption as possible to the sites ... because they made our work easier. 

The RDD pressed for documentation which demonstrated improvements in 
patient care. This was inevitably difficult, if not impossible, because 
of the evaluation commencing well after. . . 

. . .date, the interface between HIS and EPR being extremely blurred, and the 
paucity of analysable clinical data which failed to provide a baseline 
for comparison . 

As in the scoping study, the... as a descriptive story to the sites, whereas 
the computer scientists attempted to abstract their findings into areas 
such as duplication of information or interface issues. The aim of 
these sessions was to check that the site agreed... of the project, a subset 
of team members produced a progress report for a major healthcare and IT 
conference. This took the form of a short paper, -which, the authors thought 
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An international collaboratory based on virtual patient records 

ABSTRACT: Virtual patient records will impact health care 
internationally the way the Internet has transformed global communications. 
Virtual patient records are virtual in the sense they provide a view of 
data possibly configured differently. . . 

... locations, but united together into a common format for viewing at the 
required time. Virtual patient records provide a user with virtual 
access to data possibly scattered around the world. The virtual patient 
record should form the basis for a collaboratory environment in which 
multiple physicians, and ultimately the patient , should be able to engage 
in an interactive electronic discussion. International implications and 
standardization issues are discussed in detail. TeleMed, a prototype based 
on a virtual patient record, is described, and its applications are 
discussed. TeleMed uses a media-rich graphical patient record to allow 
multiple physicians, possibly located remotely across a wide-area network, 
to consult on a patient record. The latest version of TeleMed supports 
real-time interactive collaborations between multiple users. ... 
. . .TEXT: Omitted) 

Efforts to build a new information infrastructure will enhance and extend 
the quality of health care worldwide. Virtual patient records will 
impact health care internationally the way the Internet and the 
World-Wide Web transformed global communications. At the same time that 
political and sociological conditions are causing a revolution in health 



care, advanced technolog^M promise to introduce changes more radical. 

Health care has lon^ lagged far behind other industries in adopting 
information technologies. More recently, pressures... 

. . . generated an intense interest in how the latest computing technologies 
can help. While computerized virtual patient records [1] could help in 
many ways, their most significant impact may come from how they change the 
way we think about international aspects of health care. In this article, 
we describe what we mean by a collaboratory based on virtual patient 

records and the international implications of this new way of managing 

health care data. 



Virtual patient records are virtual in the sense they provide a view of 
data possibly configured differently. . . 

. . . locations, but united together into a common format for viewing at the 
required time. Virtual patient records provide a user with virtual access 
to data possibly scattered around the world. This... 

. . . the way virtual memory creates larger memory spaces by accessing 
supplemental storage only when required. 

Health care providers and payers have long realized the benefits 
computer-based patient records could provide [2] . Many health care 
practitioners complain that where computer records do exist for their 

patients , the clinical data lies tangled in a morass of systems 
designed primarily for financial claims processing. In... 

. . . did the financial side of our culture computerize its records long ago 
and leave the clinical side undone? Some say it's because health care 
information has much more complexity than financial information. Some say 
it's because computers require highly structured data, and health care 
data often has inherent ambiguities. Others say economics have higher 
priority than quality of care. The heart of the matter likely rests near 
the critical role health care plays in human culture. 

Many cultural anthropologists define culture as a way of life... 
... culture, because it deals with survival. All cultures resist change; 
it's a natural and healthy tendency that helps prevent harmful 
innovations from destroying a culture. Yet, all cultures do change... 

. . . the more computers represent potentially harmful threats to our lives . 
Designs for the future of health information systems need to preserve as 
much traditional culture as possible while also providing a. . . 

...means incorporating familiar, and therefore natural, human dynamics such 
as those involved in physician and patient encounters. 

With the advent of cyberspace, we now live in a slightly different world. 
We. . . 



... open to people from all nations, knitting the individual lives of 
humanity ever closer together. Health care is trying to make the most of 
this new world, too. People in health care use real-time video 
teleconferencing in the practice of medicine and call it "telemedicine" 
[3] . Some major hospitals have even put their medical records on the 
Web [4, 5]. However, these early innovations do not take advantage of some 
of the more advanced technologies that could provide even better solutions 
for health care while at the same time incorporating the feeling of a 
traditional live encounter. 



Collaboratory. . . 



. . . the Web provides very important connectivity and access for the 
publishing of information, it supports electronic medical records in 
a very primitive way compared to other alternatives. The HyperText Transfer * 
Protocol (http) , for... 



. . . This provides very^M^r support for interactive con^M:ivity with the 
data. 



We contend the virtual patient record should form the basis for a 
collaboratory environment in which multiple physicians, and ultimately the 
patient , should be able to engage in an interactive electronic 
discussion. This requires the support of... 

. . . transaction processing necessary to support multiple users not only 
browsing, but also possibly editing a patient record simultaneously. As 
we will explain, the same architecture that enables such realtime 
consultations can... 

... a great enhancement to the typical video consult by providing, in 
real-time, the full medical record of the patient . Moreover, by 
including patients and allowing them not only to access their data, but 
also to post reports and queries, patients can become better educated and 
more empowered to manage their own health . 

Because of the high degree of complexity of health care data and the 
information intensive aspects of health care delivery, these 
collaboratory architectural features are essential for virtual patient 
records. Moreover, this same architecture that supports collaboratory 
consultations in a local region could also... We are collaborating with a 
number of organizations to help standardize the electronic management of 
patient information, using the most advanced information technologies, to 
form a seamless international continuum of quality health care. 

International Implications 

Ubiquitous adoption of nonproprietary standards provides the foundation 
that makes the Internet and the Web possible. By adopting similar 
international standards for virtual patient records, the day may soon 
come when electronic medical records securely cross international 

borders. In recent years advocates of computer-based patient records have 
set the goal of maintaining longitudinal records that will manage health 
data for a person's entire lifetime-prenatal to post-mortem. Health care 
quality depends on the continuity of care that longitudinal records could 
provide, but people often cross international borders during their lives, 
sometimes with the expressed purpose of seeking health care outside their 
own country. 

Also, highly intensive monitoring, collecting and mining of health data 
at an international level would better support public policy decisions. 
Additional information derived from. . . 

...we are collaborating with other organizations to help establish a global 
network of distributed virtual patient records for the surveillance of 
infectious diseases-the leading cause of death worldwide [6], Global 
health care surveillance networks could provide important data for 
international health policy decision-making as well as raising the 
quality of care provided by international relief organizations . 

International standards already exist for international electronic exchange 
of personal financial data. Personal health data could move across the 
networks just as easily based on international standards that provide. . . 
... S. enacted into law a process that will govern the adoption of national 
standards for health -related electronic commerce [7] . The law requires 
compliance with national standards by the year 2000. This new legislation 
requires a standard unique health identifier for each individual, 
employer, health plan, and health care provider for use in the health 
care system. The legislation also calls for other standards to be adopted 
by the U.S. Department of Health and Human Services, and it lays out a 
timetable for adopting those standards together with... 



. . . some may contend it will be a long time before we see international 
standards for health care. An additional approach to international 
standardization seeks to base compliance on the collaborative agreements... 



...applications happen t6^5e located in the world [8, 9]. 

The OMG officially established the health care domain task force 
(CORBAmed) to standardize object interfaces for health care applications 
at a global level. A number of other organizations also seek to standardize 

health care information and communication including: CEN (European 
Committee for Standardization) , UN/EDIFACT (United Nations rules for 
Electronic Data Interchange For Administration, Commerce and Transport) , 
JWG-CDM ( Joint Working Group for a Common Data Model), HL7 ( Health 

Level 7), ASTM (American Society for Testing and Materials), CPRI 
(Computerized Patient Record Institute), and others. Some of these 
standard developing organizations are currently working to develop... 

...or Request For Comment (RFC) process. 

One of the most basic standardization issues for international health 
care transactions involves how to handle the many different person 
identifiers existing in disparate medical record systems. Even within a 
single hospital , different departments may have separate electronic 
records for the same person, each with a different identification 
number for that person. Historically, health care providers dealt with 
this issue by creating a Master Patient Index (MPI) that used a limited 
set of demographic data (for example, name, gender, date of birth, and so 
on) to help retrieve the disparate elements of a patient 's records. 
Originally, MPIs were manual and were primarily used to link patient 
episodes for continuous care. Maintaining the continuity of patient care 
is still the most important role of automated MPIs. However, existing MPI 
software applications... 

... to organize a series of workshops aimed directly at this issue. 
Viability of a virtual patient record collaboratory will depend heavily 
on the ability to quickly and securely identify a patient and his or her 
respective health care providers and payers. Different indexing schemes 
used in different nations, as well as privacy MPI systems. The Health 
Level 7 standards developing organization has now started a special 
interest group focusing on MPI Mediator issues. 

A MPI can be summarized as a secured directory of uniquely identifiable 
patients together with pointers to locations where the respective 
medically related information about each patient resides. A MPI does 

not record any clinical data; it merely points to where clinical data 

is located, whether in electronic form or possibly on paper. By relying on 

proprietary. . . 

...grant access to pointers that provide authorized and secure access paths 
to records from disparate health care organizations, even when 
distributed at an international level. 

Pointers to sites of care have the potential for revealing confidential 
information about a patient ; a pointer could reveal a patient received 
care at an AIDS clinic, an abortion clinic, or in prison. Therefore, 
privacy advocates contend that standards for authorization and access 
controls must allow the patients to decide to whom they grant access and 
which records, if any, to make available... 

... 9] . The MPI Mediator approach has the potential to facilitate such 
authorizations for access to health records. 

After helping to organize the series of workshops seeking a solution to the 
MPI . . . 

. . . international level-we worked closely with CORBAmed to help generate a 
RFP requesting proposals for patient identification services. We are 
currently collaborating with various MPI vendors to encourage them to 
produce ... 




. . . the final specif ic^Bbn the OMG recommends could^^fcome the key to 

mediating international exchanges of health - related information. 



TeleMed: A Prototype Based on a Virtual Patient Record 

Working in collaboration with physicians at the National Jewish Medical 
and Research Center (NJC) in Denver, we developed TeleMed, an early 
prototype for a virtual patient record. TeleMed uses a media-rich 
graphical patient record [9] to allow multiple physicians, possibly 
located remotely across a wide-area network, to consult on a patient 
record. Consultations can take place interactively in realtime, or offline 
using textual or audio annotations combined with graphical markers in the 
record . 

The original prototype demonstrated the feasibility of virtual patient 
records, but it did not reach the stage of full functionality with actual 
patient records on a day-to-day basis. Los Alamos National Laboratory 

plans to deploy a more functional version of TeleMed in some of the small 

clinics and hospitals of northern New Mexico. The Northern New Mexico 

Rural Telemedicine . Project (NNMRTP) led by the... 

...for a collaboratory based on the TeleMed prototype. 

TeleMed dynamically assembles a chronologically oriented graphical patient 
record using data gathered from several different remote locations. It 
assigns icons and other graphical... 

... Object Request Brokers (ORBs) then mediate the interactive access 
linking the icons in the graphical patient record to the distributed 
databases that provide the persistent object storage for the multimedia 
data. . . 

... between multiple users. Multiple physicians at remote locations can 
simultaneously view, edit, and annotate the patient data. Furthermore, 
each physician can see the data another physician has entered as well as... 

. . . enables the support of full transaction processing necessary to support 
multiple users interacting with the patient record simultaneously. Such a 
capability enables the same architecture to be used for real-time... 

... different database. Once a database has been selected, a CORBA-based 
transaction vends the relevant patient record objects from the selected 
site to the requesting client. Textual data from the retrieved patient 
objects is then used to populate a list of patient names. 

When the user selects one of the names in the list, that patient 's 
Graphical Patient Record (GPR) window appears. TeleMed' s chronological 
GPR is dynamically assembled by CORBA-based transactions that merge all 
of the relevant data from multiple databases to form the seamless view 
presented to the user. The fact the user selected... 

... as a guide to help determine where to produce full transverse slice 
images of the patient . In this interface, the scout is now used as a 
guide to help the user... 

... particular interface. Suppose a general practice physician wanted to 
consult with a specialist about this patient by using TeleMed. And let's 
say the physician wants to discuss the particular slice... 

. . .display window depending on the type of annotation. 

Conclusion 

An international collaboratory based on virtual patient records can 
eventually include a wide assortment of health care applications all 
designed to allow health practitioners and their patients to work 
together more naturally and more effectively and, thereby, deliver a higher 
quality care . . . 



... to invest now durin^B time of transition to develop^Wiew information 
infrastructure for health care. 

More information about TeleMed can be found at 

http : //www. acl . lanl . gov/TeleMed 
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...TEXT: management skills (Roberts, 1999). 
Privacy Issues 

Employers have the obligation to protect employees 1 personal records, 
medical records, and the results of employment related tests (Myers, 
1988) . Major privacy concerns related to. . . 

. . . information can be stored in the system and who can access, change, or 
modify the database (Noe, Hollenbeck, Gerhart, and Wright, 1994). It is 
suggested that a company should only collect... 

... to those who had physical access to it. But today an HRIS uses a 
relational database to store and categorize different data files 

(Hubbard, Forcht, and Thomas, 1998). Relational databases not only allow 
the user to access and merge information via a common element like 
name or employee identification number, it also allows this information 
to be accessed from different locations within the company. . . 
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...TEXT: the interface 
registration system. 

(Photograph Omitted) 



engine and sends 



information back to the 



If the patient does not appear on the list, the user enters information 
into the EMerge database • EMerge assigns a medical record number. The 
registration process is the only place in the enterprise where a medical 
record number can be assigned. 

Inbound TSS API calls are made to EMerge from workstations... 

. . . into EMerge from the existing Eclipsys 7000 registration system. Out of 
more than one million patient records, over 75,000 possible duplicate 
records were reviewed between January and September. More than 35,000 of 
these records were identified as duplicates and were merged 
reducing the EMerge database to more than 1.6 million records. As the 
project grew to include multiple implementations, UAB decided additional 
resources were needed. The healthcare consulting firm, Insource 
Management Group, was enlisted to provide project management. 



Phase 2: As of. 
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. . .TEXT: enterprise. 

Yet, the enterprisewide indexing application itself is often expected to do 
a lot: receive patient or member information from many syss terns, assign 
unique indentifiers to each individual, merge records pertaining to the 
same person, update its database , remove duplicate records , 
correleate a patient 's records throughout the enterprise, keep the source 
system's identifier or pointers to it ... automatically merges them. 

"'Physically,' the records are not merged within the global master person 
index database , so that if it is later determined that these records are 
not for the same patient , the records can be 'unmerged' and restored to 
the original state," Frede noted. "We feel... 

...s own network that person's data is stored. "We take the data into our 
database and we link it to a single patient across all systems," Frede 
says. "We're not replacing any existing system. We consider ourselves... 
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. . .adopted into other 

hospitals across Hong Kong. 

Title: "LifeCode: A Natural Language Processing System for 

Medical 

Coding and Data Mining " 
Submitted by: A-Life Medical / Inc. (San Diego, California. USA) 
Summary: The LifeCode Expert System (patent pending) is a 

Natural 

Language Processing (NLP) based engine that extracts and 
provides useful data for the Emergency Medicine 

clinical 

specialty. Its outputs provide an integrated 

illustration 

of the patients 1 medical condition ( s ) , course of 

treatment 

and disposition while intelligently deleting 

redundant 

information /• combining complementary information 

to create 

the integrated picture, and assessing the medical 

risks and 

service. LifeCode also has a "self-awareness" feature 

that 

recognizes its own limits... 
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...TEXT: Cleanup: Tracks and deletes files you no longer need. 

* Setup Monitor: Tracks new program installations. 

* Duplicate File Finder : Finds duplicates by name/ date /size 
combinations , size filters. 

* Disk Data: Finds folders and files; also gives a report of disk space... 
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...TEXT: to bring together records derived from a standard LC cataloging in 
a minimum of processing time . A second phase processed records that 
didn't match on the minimum set of data elements through the remainder of 
the algorithm, comparing the... 



25/3,K/l5 (Item 15 from file: 15) 

DIALOG (R) File 15:ABI/Inform(R) 

(c) 2004 ProQuest Inf o&Learning . All rts. reserv. 
00725639 93-74860 

Cleanup and Deduplication of an International Bibliographic Database 

Toney, Stephen R. 

Information Technology & Libraries vllnl PP: 19-28 Mar 1992 
ISSN: 0730-9295 JRNL CODE: J LA 
WORD COUNT: 5235 

...TEXT: trail of great value when programming errors or additional cleanup 
requirements were noted. Furthermore, each duplicate record merged 
was noted in its master record by number and date • Both these ideas are 
recommended . 

Finally, never underestimate the time it takes for a project... 
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. . . file (same name and size) ; 

* identical copies of the same file (same name, size, and contents ) ; 

* unexpected identical files (different names but same size and 
contents ) . 

The Comparison options box is where you make your selections . 

Either filename or file size must be ... filenames ; wildcards are not 
allowed. The list of extensions starts off containing . and .TMP, both 
commonly used for temporary files . (It's quite common to find 
temporary files whose contents are identical to those of the 
corresponding permanent files.) Again, wildcards are not allowed. 

If you wish. . . 
...that were used in comparing files. 

You'll probably be amazed at the number of duplicate files 
Dupeless finds the first time you use it. And even though the prices of 
immense hard disks continue to drop... on a variety of definitions, and 
letting it do so in something less than geologic time . Speedily matching 
up files with identical contents was a separate challenge, solved in 
part by having Dupeless defer the actual file-contents... 

...all of the files in the temporary list as a group of duplicates—unless 
the Contents option has been selected. 
Finding Identical Files 
If the Contents option is selected, Dupeless must go through another 
step before declaring a set of files... 
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... ready to go. The program seamlessly accesses, indexes, and stores 

file configurations, mailing output, and dupe ( duplicate record ) 
information . 

Sorting, combining , and packaging (called presorting) mail for 
rapid disbursement is a time -consuming task — no matter who does it. But 
the USPS has long offered substantial postage... 

...are simultaneously checked, but the program will allow you to assign any 
number of field combinations - Searches will locate exact duplicates ( 
records whose contents match down to the letter) as well as the more 
elusive near-dupes (records that, due... 
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...ABSTRACT: a window that lists only duplicated files; DupLocator 
examines only file names and not their contents , forcing the user to 
match information to weed out true duplicates . Each file ' s disk path 
is displayed in a large window below the main window. DupLocator has... 
Quadra 700. 

Seeing doubles. The View Duplicate Files command creates a window 
that lists only duplicated files . Finding the true duplicates then 
becomes the main task. DupLocator doesn't examine the contents of a file 
duplicates are based solely upon the file name. 

The main window displays. . . 
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key word. Also, an ascending or descending Windows Sort can sort 
bys tring, number, or date . 

Un-Dupe is a " remove duplicates" capability that looks for 
duplicate information in a merged database. Envelopes and Shippling 
Label Printing automatically recognizes the address portion of a window and 
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... of the products tested allow you to select a record or file and 

then to delete it. We deleted multiple records by removing all 
customers older than a fixed entry date . 

Infrequently, you may find that you need to make changes in your data 
files that . . . 
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... either to the hard drive or a floppy diskette. Results from several 

searches can be combined in a single file ( duplicate records are 
automatically removed ) and the results can be printed at a later date . 
The SDI feature has been enhanced to allow batch processing of complex 
queries which include. . . 



?ds,. 

Description 

DUPLICAT? OR DUPE? ? OR COMMON? OR SAME? OR REDUNDAN? OR R- 
EPLICA? OR REPEAT? OR IDENTICAL? OR DOUBLE? OR ANALOGOUS? OR - 
SECOND OR PARALLEL? OR MULTIPLE? 

RECORD OR RECORDS OR DATA OR FILE OR FILES OR INFORMATION 
OR CITATION?. 

RECORD OR RECORDS OR DATA OR FILE OR FILES OR INFORMATION 
OR CITATION? 

REMOV? OR DELETE? OR DELETING? OR ERASE? ? OR ERASING? OR - 
DEDUPE? ? OR CHECK? OR FIND???? OR IDENTIF? OR DETECT? OR C- 
OMPAR???? OR MATCH? OR PREVENT? 
SI (3N) S3 
S5 (5N) S4 

MERGE? OR MERGING? OR CONSOLIDAT? OR COMBIN? OR JOIN??? OR 
GROUP? OR COMPRESS? OR COMPOSITE? 
S7 (5N) S5 
S6 AND S8 

MEDICAL? OR PATIENT? OR HEALTH? OR HOSPITAL? OR CLINICAL? 

510 AND S9 

511 AND IC=(G06F-007/00 OR G06F-017/30) 
S9 AND S10/TI,AB 
S13 AND IC=G06F? 

512 NOT S14 

File 348: EUROPEAN PATENTS 1978-2004/Mar W03 

(c) 2004 European Patent Office 
File 349:PCT FULLTEXT 1979-2002/UB=20040325,UT=20040318 

(c) 2004 WIPO/Univentio 

9 



Set 


I terns 


SI 


1559170 


S2 


721154 


S3 


723805 


S4 


1427234 


S5 


i 

153441 


S6 


21748 


S7 


1255200 


S8 


i 

12904 


S9 


3483 


S10 


300852 


Sll 


510 


S12 


36 


S13 


52 


S14 


25 


S15 


34 


?show 


files 



?t.£l4/3,k/l, 2,10, 15,16, 




14/3, K/l (Item 1 from file: 348) 

DIALOG (R) File 34 8: EUROPEAN PATENTS 

(c) 2004 European Patent Office. All rts. reserv. 



01535460 

A method for generating a user interface 

Verfahren zur Erzeugung einer Benutzerschnitts telle 

Procede pour la creation d f une interface utilisateur 

PATENT ASSIGNEE: 

SCA Hygiene Products AB, (2569621), Backstensgatan 5, Molndal, 405 03 
Goteborg, (SE), (Applicant designated States: all) 
INVENTOR: 

Kling, Asa, Alegardsgatan 265, 431 50 Molndal, (DK) 
Svensson, Bjorn, Klarinettvagen 4, 435 41 Molnlycke, (SE) 

LEGAL REPRESENTATIVE: 

Plougmann & Vingtoft A/S (101177), Sundkrogsgade 9, P.O. Box 831, 2100 
Copenhagen, (DK) 

PATENT (CC, No, Kind, Date) : EP 1280085 Al 030129 (Basic) 

APPLICATION (CC, No, Date): EP 2002078062 020725; 

PRIORITY (CC, No, Date) : US 911733 010725 

DESIGNATED STATES: AT; BE; BG; CH; CY; CZ ; DE; DK; EE; ES ; FI ; FR; GB; GR; 

IE; IT; LI; LU; MC; NL; PT; SE; SK; TR 
EXTENDED DESIGNATED STATES: AL; LT; LV; MK; RO; SI 
INTERNATIONAL PATENT CLASS: G06F-017/60 
ABSTRACT WORD COUNT: 123 
NOTE: 

Figure number on first page: 4 

LANGUAGE ( Publication, Procedural, Application) : English; English; English 
FULLTEXT AVAILABILITY: 

Available Text Language Update Word Count 

CLAIMS A (English) 200305 849 

SPEC A (English) 200305 6135 

Total word count - document A 6984 

Total word count - document B 0 

Total word count - documents A + B 6984 



INTERNATIONAL PATENT CLASS: G06F-017/60 



. . .ABSTRACT Al 

A method for generating a user interface for displaying data, such as 
healthcare product consumption and cost information, comprises 
allocating a plurality of institutions, e.g. hospitals or nursing 
homes, into groups. Groups may be altered, and time periods and 
currencies may. . . 

...SPECIFICATION group of the plurality of groups of institutions, 
identification data of institutions contained in that group , 

- creating a second set of data in response to a second user input, 
the second set of data representing at least an identification of 
the institution group, the time period, and the currency type selected by 
the user... group of the plurality of groups of institutions, 
identification data of institutions contained in that group , 

- create a second set of data in response to a second user input, the 
second set of data representing at least an identification of the 

institution group, the time period, and the currency type selected by the 
user. . . 

...CLAIMS group of the plurality of groups of institutions, identification 
data of institutions contained in that group , 

- creating a second set of data in response to a second user 
input, the second set of data representing at least an 
identification of the institution group, the time period, and the 
currency type selected by the user. . . 
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- create a second set of data in response to a second user 
input, the second set of data representing at least an 
identification of the institution group, the time period, and the 
currency type selected by the user. . . 
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. .ABSTRACT A2 

An automated collection and analysis patient care system and method 
for ordering and prioritizing multiple health disorders to identify an 



\ndex disorder is desc^fcbd. A plurality of monitoring are retrieved 

from a database. Each or the monitoring sets include stored measures 
relating to patient information recorded and derived on a substantially 
continuous basis. A patient status change is determined by comparing at 
least one stored measure from each of the... 



.least one other stored measure with both stored measures relating to the 
same type of patient information. Each patient status change is 
ordered in temporal sequence from least recent to most recent. A 
plurality of health disorder candidates categorized by quantifiable 
physiological measures of pathophysiologies indicative of each respective 

health disorder are evaluated and the health disorder candidate with 
the pathophysiology most closely matching those patient status changes 
which occurred least recently is identified. 

.SPECIFICATION or improvement of cardiovascular disease. Additionally, 
the Nappholz device is unable to automatically self-reference multiple 
data points in time and cannot detect disease regression. Also, the 
Nappholz device must be implanted and cannot function as an external ... as 
Patient 1, serial peer measures, such as peer measure Xo through Xn) ) , 
represent the same type of patient information monitored over time. 
The combined peer measures for all patients can be categorized into a 
health disorder- or disease-matched. . . 



...CLAIMS each patient status change for each stored measure to the 
stickiness indicator relating to the same type of patient 
information as the stored measure being compared ; and 
the analysis module (207) determining a revised program of patient 
diagnosis or treatment responsive ... each patient status change for 
each stored measure to the stickiness indicator relating to the same 

type of patient information as the stored measure being compared 
; and 

determining a revised program of patient diagnosis or treatment 
responsive to each patient status... 
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Detailed Description 

Claims 
English Abstract 

A network (100/160/180) for mediating the peer-to-peer transfer of 
digital patient medical data includes a plurality of distributed 
agents (180A, 180B, 180E, 180F, 180G, 180H) each associated with a 
health care provider and connected to a central system (180C) . 
Periodically the agents (180A, 180B, 180E, 180F, 180G, 180H) collect 
local information relating to patient medical files and/or data 
streams, for example diagnostic images and associated reports, and 
process that . . . 

. . .where they are parsed and the attributes are stored on the central 
system (180C) in patient records with records from the same 
patient grouped together whenever possible. Registered users can 
search the central system (180C), even in the absence of a unique 
identifier, to identify patient records pointing to the remote patient 

medical files . Upon finding a patient medical file, the invention 
provides a streamlined process for communicating access authorization 
from the patient to the hospital or facility storing the medical 
files. Once patient authorization is received, secure processes are 
provided for transferring the data in its entirety to... 

French Abstract 

. . . reseau (100/160/18 0) destine a induire le transfert d'egal a egal de 
donnees medicales numeriques d ' un patient , ces donnees comprenant une 
pluralite d'agents distribues (180A, 180B, 180E, 180F, 180G, 180H) , 
chacun . . . 

...180A, 180B, 180E, 180F, 180G, 180H) collectent les informations locales 
relatives aux fichiers medicaux des patients et/ou aux trains de 
donnees, par exemple, des images de diagnostic et des rapports... 

...analyses et les attributs sont stockes sur le systeme central (180C) 
dans des registres de patients , avec des enregis trements du meme 
patient regroupes chaque fois que cela est possible. Les utilisateurs 
enregistres peuvent rechercher le systeme central (180C), meme en 
1' absence d'un identif icateur unique, afin d' identifier des 
enregistrements de patients designant les fichiers medicaux a distance 
des patients . Pour rechercher un fichier medical d'un patient , on 
peut utiliser un processus en continu de 1 ' invention permettant de 
communiquer 1 1 autorisation d'acces du patient a l f hopital ou a 
1 ' installation stockant les fichiers medicaux. A la reception de 
1 1 autorisation du patient , des processus de securite sont mis en place 
pour transferer les donnees dans leur totalite. . . 

Detailed Description 

. . . not propose an actual solution, but merely suggests that it should be 
possible to use multiple pieces of patient information to match 
patients . 



16 



While this Channin proposal represents a proposal for 
cross-institutional, peer-to-peer. . . 



...containing attributes ^relating to a remotely stored patxent medical 
file; extracting the attributes from the second metadata file ; 
comparing , by the computer, the attributes from the second metadata file 
to the attributes stored in 

Claim 

containing attributes relating to a remotely 
stored patient medical file; 

extracting the attributes from the second metadata file ; 

comparing , by the computer, the attributes from the second metadata 
file to the 

attributes stored in... 
. . .the 

database entry. 
47 

. A method in accordance with claim 26 wherein the step of comparing 
includes the steps ofparsing the second metadata file to extract the 
attributes contained therein; and matching at least one attribute 
extracted from the... 

...determining if the second metadata file relates to the database entry 
includes determining if the second metadata file is a match , a non- 
match or indeterminate with respect to the database entry. 

29 A method in accordance with claim 28 and ftirther comprising the steps 
of: if the second metadata file matches the database entry, 
updating the database 
entry; and 

if the second metadata file is a non- match with the database entry, 
creating a new database entry based on the attributes in the. . . 

. . . containing attributes relating to a remotely 
stored patient medical file; 

extracting the attributes from the second metadata file ; 

comparing the attributes from the second metadata file to the 
attributes stored 
in the database entry; and 

determining if the second metadata file. . . 

...determining if the second metadata, file relates to the database entry 
includes determining if the second 
49@' 

metadata file is a match , a non- match or indeterminate with respect 
to the database entry. 

36 A system in accordance with claim 35 and ftirther comprising the steps 
ofif the second metadata file matches the database entry, updating 
the database 
entry; and 

if the second metadata file is a non- match with the database entry, 
creating a new database entry based on the attributes in the... 

...containing attributes relating to a remotely 
stored patient medical file; 

extracting the attributes from the second metadata file ; 

comparing the attributes from the second metadata file to the 
attributes stored 
in the database entry; and 
1 0 detennining if the second. . . 

. . . relating to a 

remotely stored patient medical file; 

means for extracting the attributes from the second metadata file ; 
means for comparing the attributes from the second metadata file to 
the 

attributes stored in the database entry; and 



means for determining, 1 




the computer. . . 




.containing attributes relating to a remotely 
stored patient medical file; 

extracting the attributes from the second metadata file ; 

I 0 comparing , by the computer, the attributes from the second metadata 

file to the 

attributes stored in... 
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Detailed Description 



English Abstract 

In a healthcare system, a method consolidates patient related 
information from multiple , different sources by performing the 
following steps. Patient identification data identifying a particular 
patient is received. Messages are generated for communication to a 
corresponding information sources. The generated messages incorporate the 
particular patient identification data and a request for information 
concerning the particular patient . The messages are communicated to the 
corresponding information sources. Response messages are received 
containing requested information concerning the particular patient from 
the information sources. The requested information is sorted and merged 
from the response messages to provide data representative of consolidated 
patient related information. 

French Abstract 

...invention concerne un systeme de sante utilisant un procede destine a 
consolider des informations de patient en provenance de sources 
differentes multiples. Ce procede consiste a recevoir des donnees 
d 1 identification de patient permettant d' identifier un patient 



US, 



Claims 



particulier, a produii^^es messages en vue de leur complication a des 

sources d ' informations correspondantes , ces messages comprenant les 
donnees d ' identification du patient particulier ainsi qu ' une demande 
d 1 informations concernant ce patient particulier, a communiquer ces 
messages aux sources d ' informations correspondantes, a recevoir des 
messages de reponse contenant les informations demandees sur le patient 
particulier en provenance des sources d ' informations, puis a trier et 
fusionner les informations demandees... 



...de ces messages de reponse en vue d'obtenir des donnees representatives 
d ' informations de patient consolidees. 

Detailed Description 

services by many healthcare 
generate a correct match for over 90% of the stored patient information 
, generate a duplicate match for 5% to 10% of the stored patient 
information, and generate less than 5% of... of the large number of 
variables, some related to human interaction. 

Still other present systems combine the patient information from the 

multiple , different healthcare sources into a single view. The single 
view makes is very difficult to . . . a healthcare information system. 

3 

Summary of the Invention 

In a healthcare system, a method consolidates patient related 
information from multiple , different sources by performing the 
following steps. Patient identification data identifying a particular 
patient is... means to combine the various replies, and may include, 
without limitation, sorting, manipulating, formatting, purging duplicate 
data , merging , 

18 

and organizing the information. Hence, the server device 14 combines the 
received information in... 



Claim 

1 . A method for use in a healthcare system for consolidating patient 
related 

information from multiple different sources, comprising the steps of: 
receiving patient identification data identifying a particular patient; 
generating ... information and (e) enterprise providing service to said 
particular patient and 
includes the step of. , 

deleting redundant duplicate requested information . 

14 A method according to claim 1, including the step of 
said merging step includes . . . 

... in 

response to user command. 
28 

. A system for use in a healthcare system for consolidating patient 
related 

information from multiple different sources, comprising: 
an interface processor for, 

receiving patient identification data identifying a particular patient 
related information . 

17 A healthcare user interface system supporting user access to 
consolidated patient related information derived from multiple 
different sources and including a 
method comprising the steps of: 

receiving patient identification data identifying. . . 
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Detailed Description 

Claims 

English Abstract 

...the input dataset could be re-identified from that output dataset. 
Individual entities may include patients in a hospital or served by 
an insurance carrier, as well as voters, subscribers, customers, 
companies, or any. . . 

...data acquisition equipment, such as a biologic data sampling device, to 
prevent de- identification of patient or other confidential data 
acquired by the equipment. 

French Abstract 

. . . reidentif iee a partir dudit ensemble de donnees de sortie. Lesdites 
entites individuelles peuvent etre les patients d'un hopital ou des 
personnes beneficiant des services d'une societe d'assurance, ainsi... 

. . . un dispositif d ' echantillonnage de donnees biologiques, afin d'empecher 
la desidentif ication des donnees des patients ou d'autres donnees 
conf identielles acquises par 1 ' equipement . 

Detailed Description 

first record to said second fields and said corresponding second 
values of all of said second records ; and based on said comparing , 
extracting said first records and said first corresponding values of the 
highest priority first f ields . . . 3 52% 48% 
Total 33% 67% 

As mentioned above, there were two entries for the same person ( 
identifier #6) . Records with multiple occurrences belonging to a 
single person can be more easily identifiable. Consequently, not just the 
...time, as well as a count of the number of occurrences of each unique 
record identified within that stream of data over the same time 
period . 



t^^^capacity to tally various reco^^i 



1 5 Also included is tTre capacity to tally various recorrr identifiers , 
such . . . 



...the scrubbed data no longer identifying. 



For example, a data clearinghouse may buy personal claims data from 

multiple insurance companies and sell the combined data to 
pharmaceutical companies for marketing research. Regulations require that 
the data be de-identified. . . 

Claim 

. . . first record to said second fields and said corresponding second 
values of all of said second records ; and 

based on said comparing , extracting said first records and said first 
corresponding values of the highest priority first fields... 
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English Abstract 

...language understanding is described which provides for the generation 
of concept codes from free-text medical data. A probabilistic model of 
lexical semantics, in the preferred embodiment of the invention 
implemented. . . 

Detailed Description 

quality assurance, analyses done for management purposes and in a 
variety of focused reports that combine information from multiple 
sources, but -is not readily accessible to a human reader unless 
translation of the coded ... extracting its relevant information, 
oracle text in" calls "admission-text-out 11 which exists in the same 
file . That routine checks that the patient is of the right type, i.e., 
"inpatient". It then calls the... 



?ds 



DELETE? OR DELETING? OR ERASE? 
CHECK? OR FIND???? 



? OR ERASING? OR 



Description 

DUPLICAT? OR DUPE? ? OR COMMON? OR SAME? OR REDUNDAN? OR R 
EPLICA? OR REPEAT? OR IDENTICAL? OR DOUBLE? OR ANALAGOUS? OR 
SECOND OR PARALLEL? 

RECORD OR RECORDS OR DATA OR FILE OR FILES OR INFORMATION 
OR CITATION?. 

RECORD OR RECORDS OR DATA OR FILE OR FILES OR INFORMATION 
OR CITATION? 

REMOV? OR 
DEDUPE? ? OR 
SI (3N) S2 
S1(3N)S3 

56 (3N)S4 

MERGE? OR MERGING? OR CONSOLIDAT? OR COMBIN? OR JOIN??? OR 
GROUP? 

58 (3N)S6 

59 AND S7 

MEDICAL? OR PATIENT? OR HEALTH? OR HOSPITAL? 
Sll AND S10 

S10 AND IC=(G06F-007/00 OR G06F-017/30) 

57 AND Sll 

S14 AND IC=(G06F-007/00 OR G06F-017/30) 
S7 AND IC= (G06F-007/00 OR G06F-017/30) 
S16 AND MC=(S05-G02G OR T01-J05B4M OR T01-J06A1) 
?show files 

File 347:JAPIO Nov 197 6-2003/Nov (Updated 040308) 

(c) 2004 JPO & JAPIO 
File 350:Derwent WPIX 1963-2004/UD, UM &UP=200419 

(c) 2004 Thomson Derwent 



Set 


Items 


SI 


4328503 


S2 


2897100 


S3 


2897155 


S4 


i 

1851354 


S5 


129201 


S6 


129206 


S7 


1995 


S8 


2139702 


S9 


i 

2757 


S10 


45 


Sll 


442130 


S12 


0 


S13 


2 


S14 


26 


S15 


2 


S16 


80 


S17 


7 



It S13/5/1-2 



13/5/1 (Item 1 from file: 347) 

DIALOG (R) File 347:JAPIO 

(c) 2004 JPO & JAPIO. All rts. reserv. 



06795294 **Image available** 

INFORMATION RETRIEVAL DEVICE, INFORMATION COMPRESSING METHOD FOR 
INFORMATION RETRIEVAL DEVICE, AND RECORDING MEDIUM 



PUB. NO. : 
PUBLISHED: 
INVENTOR (s) : 
APPLICANT (s) 
APPL. NO. : 
FILED: 
INTL CLASS: 



2001-022775 [JP 2001022775 A] 
January 26, 2001 (20010126) 
I TO MAS AO 

MATSUSHITA ELECTRIC IND CO LTD 
11-194740 [JP 99194740] 
July 08, 1999 (19990708) 
G06F-017/30 ; G06F-005/00; H03M-007/30 



ABSTRACT 

PROBLEM TO BE SOLVED: To suppress the capacity of an index file by 
compressing index information more efficiently and to enable high-speed 
retrieval using the index file. 

SOLUTION: When entries of index information are successively identical , 
this information compressing method deletes by an index information 
compression part 107 the entries except only one entry, puts (n) successive 
pieces of index information with the same document number in one group as 
to index information in the same character chain, and generates 
compressed index information having the document number compressed into 
one. When byte data of position numbers are not larger than a reference 
value ( O&times ; Of ) , the low-order four bits of each of the byte data of the 
position numbers are connected to generate compressed index information 
having the position number converted into 1-byte data. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To assure the consistency of data despite their 
updating by merging a duplicated updating history information file 
and ' checking the consistency of reflection information stored in the 
file. 



SOLUTION: When an application 122 updates a database 111, a DBMS 110 
outputs the updating information 113 to an updating history information 
file 112 for recovery after the base 111 is updated. A reflection 
information duplicating device 101 produces the reflection information 106 
from the information 113 and stores the information 106 in a reflection 
information file 104. The device 101 transfers the information 106 to a 
reflection information duplicating device 102 as designated. The device 102 
of a duplicating destination database system 121 receives the transferred 
information 106 and stores it in a duplicated updating history information 



file 105. Then a refl^™ion information reflecting dev^A 103 checks the 

consistency of the reUrection information 107 stored in^Phe file 105 and 
acquires the information 107 to updating it by reflecting it on a database 
114. 
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Abstract (Basic) : EP 1387292 Al 

NOVELTY - Establishing or modifying a data source comprising 
entries related to sequences, comprises retrieving a sequence contained 
in the data set and generating a hash key by applying collision-free 
hash function; and for each data set, adding information for retrieving 
information from the data set to entry in reference data source related 
to a hash key generated from a sequence in the data set. 

DETAILED DESCRIPTION - Establishing or modifying a data source 
comprising entries related to biological sequences that are 
non-redundant with regard to the sequences on the basis of data sets of 
one or more basic data sources, each of data sets comprising biological 
sequence, comprises retrieving for one or more data sets a biological 
sequence contained in the data set and generating a hash key from the 
biological sequence thus retrieved by applying a collision-free hash 
function. The hash function maps the data representing the sequence 
onto a message of a length shorter than the length of the original data 
representing the sequence. For each of the data sets, information is 
added for retrieving information from the data set to an entry in a 
reference data source uniquely related to the hash key generated from 
the sequence contained in the data set. A new entry in the reference 
data source is provided which comprises one unique hash key and 
information for retrieving the data set or data sets comprising the 
sequence from which the hash key was generated, if the reference data 
source does not comprise an entry related to the hash key, such that 
each entry in the reference data source is uniquely identified by a 
hash key generated from a sequence. 

INDEPENDENT CLAIMS are also included for: 

(a) a computer system comprising a reference data source and/or a 
non-redundant unified database; software for accessing a database 



related to biologica^^equences , each entry in the da^Hase being 
uniquely related to o^? biological sequence; and software for accessing 
a reference data source comprising entries, each uniquely related to a 
biological sequence and having as unique identifier a hash key created 
by a collision-free hash function mapping the sequence on a data string 
of shorter length; and 

(b) a method of comparing a non-redundant data source comprising 
entries, each entry being uniquely related to a biological sequence, to 
data sets of one of more basic data sources related to the sequences, 
comprising providing a first reference data source comprising entries, 
each corresponding to exactly one entry in the reference data source, 
each entry in the reference data source having as unique identifier a 
hash key obtainable by a collision-free hash function mapping the 
sequence to which the entry relates on a data string of shorter length; 
providing a second reference data source comprising entries, each of 
which is related to exactly one biological sequence and has as a unique 
identifier the hash key created by the hash function, when applied on 
the corresponding sequence, where each entry comprises information 
related, to one or more data sets related to the sequence in one of the 
basic data sources; and comparing the first and second reference data 
source as to whether entries in the two reference data sources relating 
to the same hash key comprise different information and/or as to 
whether an entry in the first or second reference data source is 
missing in the respective other reference data source. 

USE - For establishing or modifying a data source comprising 
entries related to biological sequences that are non-redundant with 
regard to the sequences on the basis of data sets of one or more basic 
data sources, each of data sets comprising biological sequence 
(claimed) . 

ADVANTAGE - The method establishes and maintains a non-reductant 
database with less memory consumption and more algorithmic efficiency, 
which ideally would also allows for incremental updates. 
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Data family record managing method, involves adding designated record to 
family of records when determined that designated record is not duplicate 
of data records in family, and setting indicator to indicate relationship 
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Inventor: MEINIG K 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 
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Abstract (Basic) : US 20030167253 Al 

NOVELTY - The method involves adding a designated record to the 
potential family of records when it is automatically determined that 



the designated recor^k not a duplicate of the recor^Bln the family. 
An indicator is automlrcically set in each of the data^records in the 
potential family of records to indicate a family relationship between 
the records . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) a computer-readable memory medium 

(b) a record management system. 

USE - Used for identifying relationship among data records. 
ADVANTAGE - The method automatically determines missing information 
from a particular data record and also deletes duplicate records 

DESCRIPTION OF DRAWING (S) - The drawing shows an example flowchart 
of a routine for importing and automatically de-duplicating a file of 
new data records . 
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Customer database generation apparatus determines equivalence degree 
between address and name of customer, based on data equivalence condition 
of data equivalence determination definition file 

Patent Assignee: NIPPON TELEGRAPH & TELEPHONE CORP (NITE ) 
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Abstract (Basic) : JP 2003173345 A 

NOVELTY - The selection unit selects the collation data from 
received address and name information of customer. A collation unit 
collates the address and name based on the collation rule. The 
equivalence value of collation result is calculated. A collation unit 
(15) determines equivalence degree between address' and name of customer 
based on data equivalence conditions of data equivalence determination 
file. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) database generation method; and 

(2) database generation processing program. 

USE - For upgrading of databases such as correction of declared 
mistake, declared unification and checking of duplicated data . 

ADVANTAGE - Exact collation is enabled even if the information of 
the address or name is imperfect, duplicated data is removed and 
upgrading of a database is enabled. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
database generation apparatus. (Drawing includes non-English language 
text) . 

analysis unit (12,13) 

collation unit (15) 
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Database information management for duplicate records removal 
from marketing and business analysis data sets, includes comparing two 
data sets to detect duplicate records by dividing first data set into U 
subsets and second into V subsets 
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Abstract (Basic) : GB 2378789 A 

NOVELTY - A method of comparing two data sets, each of which 
contains a number of records, to detect duplicate records, comprising: 
dividing the first data set into U subsets; dividing the second dataset 
into V subsets, at least one of U and V being grater than 1; and 
comparing each subset , at least some or all of the comparisons being 
carried out simultaneously in parallel by parallel processors. The 
subsets of the first and second data sets contain approximately equal 
number of records. 

DETAILED DESCRIPTION - AN INDEPENDENT CLAIM is also included for a 
data processing system. 

USE - Database information management for the removal of 
duplicate records from large data sets, particularly for client 
profiling, marketing and business analysis. 

ADVANTAGE - Provides a method for removing duplicate records 
from a data set in a more efficient and economical manner with regard 
to the costs involved in processor memory and time, while ensuring that 
performance doesn't depend on the relevance and accuracy of the chosen 
sorting key and that the comparison is not total. 

DESCRIPTION OF DRAWING (S) - The drawing shows a block diagram that 
illustrates a central processor receiving data sets S and T and divides 
these into subsets. 
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Database management method involves identifying data cells which 
satisfies input data filtering criteria, and sorting data sets of 
identified cells based on priority list for display 
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Abstract (Basic) : US 20020143780 Al 

NOVELTY - The order of data filtering criteria input by the user is 
recorded and a sorting priority list defining a sorting priority for 
data cells in data table is generated. The data cells satisfying the 
filtering criteria are identified and displayed interleaved between 
non-identified cells. The data set of identified cells are sorted based 
on priority list and graphically displayed to the user by highlighting 
it. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for 
database management system. 

USE - For managing database storing business information, 
educational information, personal information, medical information and 
tax data. 

ADVANTAGE - The sorting process enables the user to find desired 
data on the same screen. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
data management system. 
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Abstract (Basic) : WO 200169413 A2 

NOVELTY - An array of data sharing a common value for a particular 
data field is selected. An append tag defining the common value of the 
data field, is inserted in a append section preceding the data array 
such that each data inherit the common value, when the selected data 
file is read. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) Data file; 

(b) Computer system; 

(c) Computer program product 

USE - For storing large array of data generated in digital imaging 
and database compilation in computers. 

ADVANTAGE - As the append tag defining common value of data field 
shared by the data array is included, redundancy in a data file 
is removed , markup of data is minimized, files are parsed and 
transmitted more quickly, less storage memory is required, additional 
information can be added without designing special parser. 

pp; 2 0 DwgNo 0/0 

Title Terms: MULT I ; RECORD; DATA; STORAGE; PROCESS; COMPUTER; INSERT; 

APPENDAGE; TAG; DEFINE; COMMON; VALUE; DATA; FIELD; SHARE; DATA; ARRAY; 

APPENDAGE; SECTION; PRECEDE; RESPECTIVE; DATA; ARRAY 
Derwent Class: T01 

International Patent Class (Main) : G06F-007/00 ; G06F-017/00 
International Patent Class (Additional): G01N-033/48; G01N-033/50; 

G06F-017/30 ; G06F-019/00 
File Segment: EPI 



17/5/7 (Item 7 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. All rts . reserv. 

011145597 **Image available** 

WPI Acc No: 1997-123521/199712 

XRPX Acc No: N97-101810 

Correlation checking method for common data in online processing 
system - involves performing simultaneous updation of common data stored 
by each section, if correlation check result is judged to be normal 

Patent Assignee: HITACHI LTD (HITA ) 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 9006660 A 19970110 JP 95154285 A 19950621 199712 B 

Priority Applications (No Type Date) : JP 95154285 A 19950621 
Patent Details : 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 9006660 A 6 G06F-012/00 

Abstract (Basic) : JP 9006660 A 

The method involves judging need for updation of each section, when 
data is inserted into each section. When updation is needed, 
correlation check between these sections is performed. 

When the check result is normal, common data stored by each 
section is updated simultaneously. 



»> ADVANTAGE - Rest^ns duplication of data inserti^^ in relative 

sections. Improves p^Wision of database between each^cction. 
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(record$2 with match$5) and 
criteria ) and 

( (electro$2cardiograph or ECG) 
or (electro$2encepholograph or 
EEG )) and (patient$2 and 
clinical) 


US PAT 


2004/05/26 
08 :33 


47 


18681 


707/100, 102, 104 .l.ccls. or 
707/2 00-2 03, 2 05, 206 .eels . 


US PAT; 
US-PGPUB 


2004/05/27 
07 :22 


48 


15456 


705/1-3 .eels . or 
709/200-2 03, 223, 224 .eels. 


US PAT; 
US-PGPUB 


2004/05/27 
07 :22 


49 


1 


(identif$5 adj2 record$2 ) and 
( (appl$5 or us$4) and 
(record$2 with match$5) and 
criteria ) and (laboratory 
adj2 test$2 and result$2 ) and 
(physical adj2 
characteristic$2) and 
(patient$2 and clinical) 


US PAT 


2004/05/27 
11:35 


50 


7 


(identif$5 adj2 record$2 ) and 
( (appl$5 or us$4) and 
(record$2 with match$5) and 
criteria ) and (laboratorv 
adj2 test$2 and result$2 ) and 
(patient$2 and clinical) 


US PAT 


2004/05/27 


51 


18581 


7 0 7/1 - R ft-1 n rrle nr 
707/100, 102,104. l.ccls. or 
707/200-2 03, 2 05, 206. eels. 


US PAT; 
US-PGPUB 


2004/05/27 
13 :46 


52 


18681 


7 0 7/1 - R Q.l A r«r»1c! r\*r 

707/100, 102, 104. l.ccls. or 
707/200-2 03, 2 05, 206. CCls. 


US PAT; 
US-PGPUB 


2004/05/27 
13 :46 


53 


12350 


705/1-3 .eels . or 
709/200-203 , 223, 224 .eels . 


US PAT; 
US-PGPUB 


2004/05/27 
13 :47 


54 


5121 


identif$5 adj2 record$2 


US PAT 


2004/05/27 
13 :47 


55 


2459 


(appl$5 or us$4) and (record$2 
with match$5) and criteria 


US PAT 


2004/05/27 
13 :47 





Hits 


Search Text 


d2T 


Time Stamp a 


56 


15456 


705/1-3 -eels, or 
709/200-203 ,223 ,224 *ccls . 


USPAT; 
US-PGPUB 


2004/05/27 
13 :47 


57 


5132 


identif$5 adj2 record$2 


USPAT 


2004/05/27 
13 :47 


58 


28834 


allerg$4 


USPAT 


2004/05/27 
13 :48 


59 


426 


ventilation with data 


USPAT 


2004/05/27 
13 :48 


60 


2469 


(appl$5 or us$4) and (record$2 
with inatch$5) and criteria 


USPAT 


2004/05/27 
13 :48 


61 


28834 


allerg$4 


USPAT 


2004/05/27 
13 :48 


62 


426 


ventilation with data 


USPAT 


2004/05/27 
13 :48 


63 


9477 


blood with oxygen 


USPAT 


2004/05/27 
13 :49 


64 


3 9210 


blood with (oxygen or 
Dressure) 


USPAT 


2004/05/27 
13 :49 


65 


762 


infusion adj2 pump$2 and pulse 


USPAT 


2004/05/27 
13 : 50 


66 


762 


infusion adj2 pump$2 and pulse 


USPAT 


2004/05/27 
14:04 


67 


3 


(identif$5 adj2 record$2 ) and 
( (appl$5 or us$4) and 
(record$2 with match$5) and 
criteria ) and allpraS4 and 
(blood with (oxygen or 
pressure) ) 


USPAT 


2004/05/27 


68 


3535 


infusion adj2 pump$2 


USPAT 


2004/05/27 
14:05 


a q 
o y 


J. 


(identif$5 adj2 record$2 ) and 
( (appl$5 or us$4) and 

V -L. V— LV J_ \JL tjt £j Wi. L.11 1 1 Id L. Lll »f» «J / C111L1 

criteria ) and (ventilation 
with data ) 


TTODAT 


2004/05/27 
14:49 


70 


6 


(identif$5 adj2 record$2 ) and 
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